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ASSOCIATED SETS OF POINTS* 

BY 

ARTHUR B. COBLE f 

INTRODUCTION 

Two sets of n points ordered with respect to each other, the one, P*, in a 
linear space Sk , determined by the equations 

(upi) = 0, (wp 2 )=0, -, (iip n } = 0, 

and the other Q?T k ~-, in a linear space S n -k-z, determined by the equations 
(w/i) = 0, (w/ 2 ) == 0, , (vq n ) = 0, 

are called associated sets if the factors of proportionality in the coordinates 
of the points can be so chosen that an identity in u, v exists of the following 
form : 

(1) ( upi ) Ogi ) + ( upz ) ( vq 2 ) + +( up n )(&lt;?) = . 

This relation, obviously mutual, between the two sets is such that either set 
uniquely defines the other to within projective modifications. Some general 
properties of such sets have been given by the writer. J 

A characteristic algebraic property of two associated sets is that comple 
mentary determinants formed from the matrices of the coordinates of the two 
sets of points when taken so that (1) is satisfied are proportional. A char 
acteristic geometric property is the following: On k + 3 of the points of P* 
there is a unique rational norm curve N k upon which the k + 3 points deter 
mine a set of k + 3 parameters; on the complementary set of n k 3 
points of Q?T h ~ 2 there is a pencil of linear spaces S n - k -3 whose members on 
the remaining k + 3 points determine a set of k + 3 parameters; these two 
sets of k + 3 parameters are projective. . 

Unless k n k 2 the associated sets are in spaces of different dimen 
sion. Conventional methods of passing from one space to another are the 
process of mapping the space of lower dimension upon that of higher dimen 
sion, and the process of projecting from the space of higher dimension upon the 

* Read before the American Mathematical Society at Evanston, Dec. 29, 1922. 

t This investigation has been pursued under the auspices of the Carnegie Institution of 
Washington, D. C. 

J A. B. Coble, Point sets and allied Cremona groups (I), these Transactions, vol. 
16 (1915), p. 155, in particular 1, 2 and theorems (25), (26); also (II), vol. 17 (1916), 
p. 345, 4 (16). These are cited as P. S. I or II. 
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2 A. B. COBLE 

one of lower dimension. Thus in the simple case of Pi, n points .TO, .TI 
(i =-- 1 , , n) on a line, the line is mapped by means of the totality of binary 
(n - 3)-ics in .r, i.e., by y = (a x) n ~ s , , y n - 3 = (a n _ 3 z)"~ 3 , upon the 
points y of a rational norm curve N n ~ 3 in S n _ 3 in such a way that Pi is mapped 
upon its associated $T 3 On the other hand Ql~ z is projected from any 
(S n _ 5 which is (n - - 4 )-secant to N n ~ 3 upon its associated Pi . 

Two problems considered in this paper are: When n k - = k can the 
space S k be mapped upon the space S n -h- i so that the set Pi is mapped upon 
the set $T"*~ 2 ?; when n - k - 2 &gt; k can the set Q^ k ~^ be projected upon 
the set Pi ? For k = 2 the first problem is solved in 1, the second in 2. 
For k --= 3 the first problem is solved in 3. For the general set Pi there 
appears to be no solution to the second problem and this probably would be 
true of further sets also. 

In 4 particular sets, i.e., those for which n , k have particular values, are 
considered. Each of these presents its own peculiarities. Also special sets, 
i.e., those which for given n, k satisfy in addition some projective conditions, 
receive some attention. Those conditions which are invariant under regular 
Cremona transformation of the set (cf. P. S. II, 4) are especially emphasized. 
Their form in the two sets is often very diverse. Thus if Pi is on a quadric 
with a node, then the associated Qt is on a rational quintic curve and con 
versely. In this section the discussion is carried through the values n : 10 . 

The results obtained for the sets of nine and ten nodes of the rational sextic 
and of the symmetroid are useful in connection with the author s investigations 
of the modular functions of genus four attached to these figures.* 



1. MAPPING or Pi UPON ITS ASSOCIATED 
The space 82 is mapped upon the space S n -4 by means of a linear system S 
of oo "~ 4 plane curves. The points of the plane are mapped upon a 2- way 
in S n -4 of order X where X is the number of variable intersections of two 
curves of S . The intersections of this 2-way by the linear S n -$s contained 
in (S n _4 correspond in Sz to the curves of S . We have therefore to find a 
system S so related to the set of points PI that the additional condition that 
three points of Pi are on a line has as a consequence that there must exist a 
curve of S on the remaining n - 3 points of Pi and therefore also that the 
corresponding n 3 points of Q^~* lie upon an S n _ 5 in S n _ 4 . This ensures 
the proportionality of complementary determinants in the matrices of the two 
point sets. Of course this requirement may not define the system S and we 
seek merely a simple system S with the required property. 

* A part of this work appears in abstract in the Proceedings of the National 
Academy of Sciences, vol. 7 (1921), (I) p. 245; (II) p. 334. These are cited as 
Proc. I or II. 
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The cases where n is even and n is odd are slightly different and we begin with 
the mapping of a Piy+a upon its associated Q^+l . In S 2 pass through Pfy+s 
a proper curve C of order j with a (j 2)-fold point at a point r. Also pass 
through Plj+s a proper curve D of order ( j + 1 ) with a ( j -- 1 )-fold point at 
r which meets C in (2j 5) points s^, s 2 , , s 2 j-r,. If LI, L^ are distinct 
lines on r , then D , CL, CL Z cut out the same set S upon C, so that the set S 
lies in an I]L~ 3 5 on C . The choice of the points r , s thus depends upon 2j 5 
constants when P^j+s is given. Let A , B respectively be arbitrary sets of 
(j 2) and (j 3) lines on r. Then in AC + BD = we have a system 
of curves of order (2j 2) with a (2j -- 4)-fold point at r, on Pjy+s, and on 
the points S . The parameters in A and B are essential. For if AC + BD 
= A C + B D then (A - A )C = (B - B ) D , whence A = A &ndB^B , 
since C and D are proper curves. Thus the system AC + BD = contains 
co 2 ^ ~ 4 curves. If three points of P^j+s are on a line, a curve of the system can 
be passed through (2j -- 4) further points of the line, which therefore will 
contain the line as a factor. The complementary factor will be a curve of 
the required system S of order (2j 3) with a (2j -- 4)-fold point at r and 
on the set S, and this curve will pass through the complementary set of 2j 
points of PI/+3 . Hence the system 2 will map the set Plj+s upon its associated 
set. 

For the case n even, or a Plj+2 , we pass through Pfy+2 two proper curves 
C , D of order j with a common (j 2)-fold point r which meet again in 
( 2j 6 ) points S . Here the choice of r , s depends upon 2j 6 constants. 
Let A, B be arbitrary sets of (j - 3) lines on r. Then A AC + BD = 
we have a linear system of dimension (2j 5) of curves of order (2j 3) 
with a (2j 5)-fold point at r, on Pfy+2, and on the set S. If three of the 
points of PZJ+Z are on a line, one curve of the system contains this line as a 
factor, whence one curve of the required system 2 of order (2j --4) with a 
( 2j 5 )-fold point at r and on S will pass through the complementary set of 
( 2j 1 ) points of P-J/+2 . This system S therefore effects the required map 
ping. Hence 

THEOREM 1. The plane set of points Pi is mapped upon its associated Q%~* 
by a linear system of curves of order (n 6) with an (n 7)-fold point at r 
and on a set of ( n 8 ) points S in such a way that the plane is mapped upon 
the normal 2-way, M%~ 5 , of order (n 5 ) in S n - 4 // n is even the points S 
are the further intersections of two proper curves of order ( n 2 )/2 with a common 
(n 6) /2-fold point at the arbitrarily chosen point r and on the given set P%. 
If n is odd the points S are the further intersections of two proper curves of order 
(n - 3)/2 and (n -- l)/2 with respectively (n - 7)/2- and (n - 5) /2-fold 
points at r and on Pi. For given Pi the choice of the points r , S depends upon 
(n 8) constants. 
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The mapping described above becomes evanescent for n = 6 and n - 
In the case of P\ let a pencil of cubics on P\ meet again in s\ , s 2 , s 3 . Then 
conies on S map Pi upon its associated Ql. For if three of the points of Pi 
are on a line, the complementary three are on a conic with Si , s 2 , s 3 and there 
fore map into three points of Ql on a line. Hence 

THEOREM 2. Six corresponding point pairs of a quadratic transformation 
are associated Pi , Ql if Pi and the singular triangle of the transformation are the 
base points of a pencil of cubics. 

In the case of P? \ve pass a pencil of cubics through P? to meet again in 
Si , s 2 . Then conies on si , s z map Py upon its associated Q? in S 3 . In 
this mapping the plane becomes a quadric on Q? and the points on the line 
Si s 2 become the directions on this quadric about the eighth base point of the 
net of quadrics on Q* . Thus to the o 2 possible choices of the pair s\ , s^ 
there correspond the set Q% and the 2 quadrics on it. 

We observe also that the cases n = 8, n == 9 are exceptional in that for 
PS r is the ninth base point of the pencil of cubics on Pi and that for Pi r is a 
point on the cubic determined by Pi. For further cases r may be taken in 
general position. 

2. THE PROJECTION OF Q% +i UPON ITS ASSOCIATED Pl + ^ 

We now consider the set Q* +4 as given in Sk and ask for spaces L of dimension 
k 3 such that under projection from L , the set Q will become its associated 
set in the plane. Two lemmas are needed. 

LEMMA 1. The Sk-z ir determined by L and q\ is a (k I }-secant space of 
the norm curve N* on &lt;/ 2 , , &lt;?A-+4 

For if T is the parameter of the pencil of St-i s on TT and t the parameter on 
N* the incidence condition of Sk-i T and point t is a ( 1 , k ) relation on r , t 
which in general would have only k + 1 pairs T , t in common with any (1,1) 
relation on T , t . If this (1,1) relation is the projectivity mentioned in the 
introduction between the parameter r of the line pencil on p\ in S 2 and the 
parameter t of N\, then it is satisfied by the k + 3 pairs t , T determined by 
g 2 , , q k+ 4 . Therefore the projectivity determines a ( 1 , 1 ) relation which 
is a factor of the ( 1 , k) relation. The complementary factor of degree k -- 1 
in t determines the points of N* on TT . Thus the k + 4 norm curves on the 
sets of k + 3 points q selected from Q^+t are projected from L into k + 4 
rational k- ics in the plane on the points of Pj +4 and with respectively a 
( k - 1 )-fold point at each point of Pl + . This remark is utilized in 
Theorem 5. 

LEMMA 2. Quadrics on q z , , gt+4 cut TT in quadrics apolar to a unique 
quadric Q* in TT and L in TT is the polar Sk-z of qi as to Q, . 

For the ( 2 ) linearly independent quadrics on N* cut TT in ( \ ) sections on 
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the k -- 1 points common to TT and N^, whence of these only () (k 1) 
are linearly independent in TT. Therefore k -- 1 quadrics on A T * contain TT 
and the ( /c | 2 ) -- (k + 3) quadrics in S* on q , , q k+ t cut TT in at most 
(*2 2 ) -- (A- + 3) -- (A- 1) == (I) -- 1 linearly independent quadrics all of 
which are apolar to at least one quadric Q* in TT . Moreover the *S 2 on three 
points of 72 &gt; , 7t+4 and the Sk-i on the remaining A- points meet TT respec 
tively in a point and S k -s which are pole and polar as to Q* and thereby Q* 
is uniquely determined. For any S k -i on S 2 together with the given Sk-i 
constitute a quadric on qi , , q/c+4 and meet TT in a pair of /Sjt_ 3 s apolar to 
Qv . Finally, if three points of q 2 , , q k+i are in an Sk-i with L and therefore 
project from L into three points of a line in S 2 , then the remaining k points and 
f/i must be in an S k -i which meets TT in an S k -s on q\ . Hence the point, 
Sk-x of TT mentioned above are such that when the point is on L then the 
Sk-a must be on q\, which requires that q\ , L be pole and polar as to Q w . 

In order to put all the points of the set Q on the same footing we now prove 

THEOREM 3. Given Q%+* in S k there exist oo k ~ z spaces L of dimension k 3 
such that all the quadrics on L and any k + 3 of the points Q meet again at the 
remaining point of Q, or also such that all the quadrics on the points Q and 
( k ~2 l ) -~ 1 points of L contain L. From any one of these spaces L the set Qk+t 
is projected into its associated P l +i . 

For there are &lt;x&gt; k ~ : Sk-z s which are (k - 1 )-secant spaces of A 7 * each with 
oo fc ~ 2 points, so that on q\ there are &lt;*&gt; A "~ 3 such spaces TT . In any such space TT 
choose L to be the polar S k - 3 of qi as to the quadric Q T determined as in Lemma 
2. Then all the quadrics of S k on g 2 , , &lt;?Jt+4 which contain L cut TT in 
another S k -3 on q\ and L has the first property described in the theorem. 
That all the Sk-a s L of the theorem are found among the (A- 1 ) -secant 
spaces TT of Nl on (?i is proved as follows. If, as given, quadrics on g 2 , 
q k+i and L meet again in q\, then the ( k 2 2 ) (^i" 1 ) ~ (A + 3) = 2A- 3 
linearly independent quadrics of this sort meet TT [L, qi] in a linear system of 
SANS S on pi of which only A 2 are linearly independent in TT . Hence A - -- 1 
of the quadrics contain the Sk-z T and therefore meet in a N* (necessarily on 
q 2 , , qk+*) which is ( A- -- 1 )-secant to TT . We observe that the configura 
tion Qk+i, L is the generalization of the set of eight base points of a net of 
quadrics as one of the points is enlarged in dimension. To prove the last 
statement in the theorem we note that if &lt;? 2 , , qk+z are on an Sk-i , this 
Sk-i together with the Sk-i on L and qk+s, ?H-4 constitute a quadric which 
must contain &lt;?i, whence in the projection p\ , p k +3 , Pk+t are on a line. Here the 
isolated position of pi is not material. 

The above discussion suggests the following construction for the set in Sk 
w r hen the set in the plane is given. 

THEOREM 4. Given the set Pj+4&gt; hi the parameter t of the line pencil on p\ be 



6 A. B. COBLE [July 

introduced as a parameter on the linear system Si of &lt;x&gt; k ~ 3 rational curves of 
order k with a (k -- 1 )-fold point at pi . Then h, , t k +i are the parameters 
of P2, , pk+4 on every curve of Si and the parameters of the multiple point pi 
determine a linear system of &lt;x fc - 3 binary (k l)-ics all of which are apolar to 
a binary k-ic, 7* . In Sk select a parameter system t on a norm curve N* . Then 
the points of N* with parameters t 2 , , t k+ i and the point of Sk determined by 
7i with reference to JVJ constitute a set q z , , q k+i , q\ associated with P* +4 . 

This is indeed an immediate consequence of the fact that the curves of Si 
are the projections of A 7 * from the k " 3 spaces L . This same projection and 
the further fact that the choice of a single curve of the system Si is sufficient 
to determine the corresponding L lead to the following theorem, which is not 
readily apparent from the plane figure alone. 

THEOREM 5. The k + 4 systems S,- of dimension k 3 of rational curves 
of order k with a (k I )-fold point at p t and simple points at the remaining points 
of P%+4 are in one-to-one correspondence with each other. 

We shall see in 4 that for Q? the co == 1 space L is the point common to 
all of the oo 2 elliptic quartics on Q?; for Q| the oo 1 spaces L are the common 
bisecants of all the &lt; * elliptic quintics on Q| ; and for Qg the =c 2 spaces L are 
the trisecant planes of the unique elliptic sextic on Qg . For further sets no 
equally simple characterization of the spaces L has been obtained. 

3. MAPPING OF Pi UPON ITS ASSOCIATED Q~ 5 

In order to map a set P\ upon its associated Ql we need only to find a further 
set Pi such that the set P? 4 == Pi + Pi shall have the property that the linear 
system S of cubic surfaces on the 14 points shall have the dimension 6, i.e., 
that all the cubic surfaces on 13 of the points shall pass through the 14th. For 
then if 4 of the points of Pi are in a plane TT a cubic surface of the system S 
can be made to pass through 6 more points of TT in general position and there 
fore to contain TT as a factor. The remaining factor is a quadric on Pi which 
contains the other four points of Pi. Hence the linear system of quadrics 
on Pi will map Pi upon its associated Ql . 

One symmetrical set of 14 points of such character may be obtained as 
follows. Given 6 points TI , , r 6 of a plane, select a quartic curve with 
simple points at r and an octavic curve with triple points at r. These two 
curves meet elsewhere in 14 points. They are mapped from the plane by 
cubic curves on the points r into two space sextics of genus 3 with 14 common 
points. The two space sextics are on one cubic surface the map of the plane 
and only one since the two sextic curves could not lie at once on two cubic 
surfaces one of which is non-degenerate. Since each sextic curve is on 4 
linearly independent cubic surfaces, there must be on their 14 common points 
4 + 4 -- 1 = 7 linearly independent cubic surfaces and the set has the required 
property. 
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The trisecant locus of the one sextic an octavic surface with the sextic 
as a triple curve meets the other sextic in 8X6 14X3 = 6 points, 
whence six trisecants of each curve are secants of the other and these two sets 
of trisecants are a double six of the unique cubic surface on both sextics 
the double six of the mapping system. The roles of the two plane curves are 
interchanged by the plane Cremona transformation of order 5 with double 
P-points at the six points r . We observe that the pair of space sextics is the 
complete intersection of a cubic and a quartic surface. 

The number of absolute constants is 4 for the points r and 8 more for each 
of the plane curves, or 20 in all. Hence in space such a set of 14 points has 
20 + 15 = 35 projective constants. A space sextic of genus three has 
15 + 9 = 24 projective constants so that on a given sextic there are &lt;x&gt; n 
such sets of 14 points which lie in a linear series I\\ . From this there follows 
that at most 11 of the 14 points can be chosen at random in space. For such 
sets from Pi to P 3 n we have 

THEOREM 6. The three-dimensional sets Pi, PI, P? , an d PII canbe mapped 
upon their associated sets Ql , Qg , Qio , and Qu by the linear system of quadrics 
on a supplemental set Pi, Pi, Pi, and PI respectively, which ivith the given 
set makes up the 14 points of intersection of two space sextics of genus three. 

The mapping system of this theorem is more general than is needful for the 
purpose. Consider for example the set Pi . It lies on a unique elliptic quartic 
E 4 , the intersection of quadrics Qi, Qz- Let C be a cubic surface on Pi which 
cuts E 4 in a residual set P 4 . Let two other points in general position be a 
set P 2 . The totality of cubic surfaces on the 12 points Pi + P 4 is made up 
of C + irQi -f- ir Qz where IT , IT are arbitrary planes. In this system of c 8 
surfaces there is a system of dimension 6 on Pi + P 4 + P 2 , whence quadrics 
on P 4 + P 2 map Pi upon its associated set Ql. This mapping is however a 
degenerate case of Theorem 6, since E 4 and a bisecant of E 4 from each point 
of P 2 make up a degenerate sextic of genus three. 

The simplest transition from Pi to Ql is obtained by taking Pi on an E 4 with 
canonical parameter u (i.e., such that the coplanar condition is u\ + u 2 + its 
+ i/4 = mod. wi , to 2 ) for which the parameters of the points of Pi are MI , 
, u s , where S?tt = a . If now we set Ui + Vi = cr/4 (i ---- I, , 8 ) then 
vi + + Vi = cr -- (HI + + M 4 ) = u- a + + u & . Hence the four 
points v are on a plane if the complementary four points u are on a plane, or 
the set v is associated to the set u. The lines joining u { , v&lt; are generators of 
a regulus on E 4 . For given Pi the cr/4 has 16 determinations, whence 

THEOREM 7. For a given set Pi there are 16 reguli on the E 4 through Pi 
such that the generators of a regulus on the points of Pi meet the E 4 again in the 
points of an associated Ql . 

Again let the set Pi be on a quadric with generators t, r and let (ar}- (at} 5 
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= and (6r) 3 (j30 2 = be two quintics of genus two of different kinds on 
PI and Q . These quintics meet in four other points PI on Q . Let Pi be a 
point in general position. Then if C\ , C{ are cubic surfaces on 1 the first 
quintic, C z , C 2 cubic surfaces on the second quintic, and TT is an arbitrary 
plane we have in Xi C\ + X 2 C( + X 3 C 2 + X 4 C" 2 + nQ a system of 7 cubic 
surfaces on Pi and P 4 . Hence there will be a system of dimension 6 on Pi , 
P 4 , PI, or the system of quadrics on P 4 + PI will map Pg upon ^9 . This 
again is a special case of Theorem 6 since a bisecant to the one quintic from 
PI makes up with the quintic a degenerate sextic of genus three and the two 
sextics thus made up have 14 common points. We shall however find in 4 
a different mode of transition from P| to Q% which exhibits more effectively 
their mutual relations. 

There appears to be no point in 84 from which a general set Qg can be pro 
jected into its associated set. If Qg is on an elliptic quintic E b (two conditions) 
a quadric on Qg will cut E 5 in a tenth point from which the desired projection 
can be made ( 4, Theorem 11). However, no general sets except planar sets 
have been found which are the projections of their associated sets. On the 
other hand no proof of the impossibility of such a projection has been found. 
We complete the mapping of sets P upon their associated sets by means 
of an apparatus derived from the elliptic curves. Let be an elliptic curve 
of order m &gt; k in an Sk It is the projection of the normal E-i from an 
S m -k-2 . The Em-i has one absolute constant and the S m _t_ 2 in S m _i has 
( m k 1 ) ( k + 1 ) further constants, so that the projection has ( m k 
-1 )(&+!) + ! absolute constants. This number added to the ( k + 1 ) 2 
- 1 constants of a projectivity in Sk furnishes m ( k + 1 ) . Hence the elliptic 
m-ic in Sk, Ef, has m(k + l) constants and can be passed through 
[m (k + 1 )/(A- -- 1 ) ] points in Sk, where the bracket indicates the largest 
integer equal to or less than the number within it. 

Since r-ic spreads cut the E in an Im r r -i, an r-ic spread on mr general points 

of contains it completely. Hence there are oo ( 1 r-ic spreads on 

E% and there are co ( * ) " r -ic spreads on the mr points cut out on E by a 
definite r-ic spread. 

Beginning then with a set P\ } we can pass an E{ through its points. Let 
an r-ic surface on Pj^ meet Ei in j(r 2) further points Pj (r -2) . Then 
there are co ( s ) r r .[ c surfaces on P|, + Pj(r-z) If we suppose that these 
surfaces are subject to a ^ further linear conditions, say to pass through 

a set of points P a , we have a linear system of &lt;*&gt; ^ 3 )~ : " r-ic surfaces on the 

base Plj + Pur-*) + P a If 4 points of P\ } are on a plane and if ( r+ 3 3 ) - jr - a 

= ( r t 2 ) - - 4 , then an r-ic surface of the linear system can be determined which 

contains this plane as a factor leaving an (r - 1 )-ic surface on Py (r _ 2) + P a 
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which passes through the remaining 2j -- 4 points of P^. This condition 
becomes 

(2) (T) -jr + 4--a. 

Since a ^ 0, then, for given j , r is defined by the inequality 

(3) ( **)+ 4 S jr. 

The modification for an odd set P&lt;y-i is readily made and we state at once 

THEOREM 8. Through a given set Plj{Plj-i\ pass an Ei and cut it by an r-ic 
surface on P|/{-Piy-i} which meets 3 again in a set Py( r _ 2 ){Pjo-2)+i| where r 
is the smallest integer defined by (3). The linear system of surfaces of order r 1 
on this residual set and on a further general set P a , inhere a is defined by (2), mitjix 
83 upon a %-way in fi^y-sl Say-el in such a way that the set P^jPi^-i} is mapped 
upon its associated Q2J ~ 5 {Q2;-i! 

For the sets Pg and PJ the numbers j , r , a are 5, 4, 4; for P^ and P\ z , 
6,4,0; for P? 3 and P? 4 , 7, 5, 4; etc. 

4. PARTICULAR AND SPECIAL SETS OF POINTS 

f 

It is the aim in the present section to consider in more detail the relation 
of particular sets P for values of n from 8 to 10 to their associated sets both 
for cases when the n points of the set are in general position and for cases 
when they are subject to certain conditions. A question naturally arises as 
to what types of conditions would be most interesting and as to what types 
of configurations connected with the associated sets would best exhibit the 
relations sought. In answer to this inquiry we recall the noteworthy theorem 
in regard to associated sets (P. S., II (16), p. 361), which states that if P 
and P n are congruent under regular Cremona transformation in Sk their 
associated sets Qn~ k ~ 2 and Q n "~ are also congruent under regular Cremona 
transformation in S n -k-2 More specifically, if P is congruent to P^ under 
the Cremona involution .r = l/.r; (i = 1, , k + 1 ) with its k + 1 
P-points at points of P, then Q^ ~~ is congruent to Q n n ~ under the involution 
x i = l/Xi (i = 1, , n k 1 ) with its n k -- 1 P-points at the 
complementary n k 1 points of Q%~ k ~ 2 . The regular Cremona group is 
generated by this one Cremona involution and projectivities. 

We shall seek therefore to express the desired relations in terms of such loci 
or in terms of such properties of these loci as are invariant under regular 
Cremona transformation. Thus a rational curve, or an elliptic curve, of order 
k + 1 on the points of P is transformed by regular transformation into a 
curve of the same order on the points of the congruent set. The same is 
true of multiples of such curves, i.e., curves of orders l(k + 1) with /-fold 
points at the points of P, if such curves exist. This property of invariance 
is shared by a certain type of surface the rational M 2 in S r +i . We shall 
first derive some facts concerning this surface for later use. 
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If r == 21 + 1 [21] the system of rational plane curves of order / + 1 on 
the base O l [O l , a] has the dimension r + 1 and maps the plane upon a 2-way 
of order r , M r 2 , in S r+ i . Each of these surfaces is the projection of the one 
of next higher order from one of its points. This is evidently the case in 
passing from the base O l to the base O l , a . But also the base O l , a- , a can 
be reduced by quadratic transformation to the base O l ~ l . Thus the series of 
surfaces M 2 constitute the progenitors of the quadric A/1 in &lt;S 3 . Lines on the 
point map into the co 1 "generators" of the surface. 

In case r is odd directions at map into a unique "directrix," a rational 
norm curve of order /; while the lines of the plane map into &lt; 2 "directors," 
rational norm curves of order Z + 1 . Since S r s on the directrix are mapped 
by sets of I + 1 lines on , and S r s on a given director by sets of I lines on 
and a given line, the directrix and a director are in skew Si, S;+i, and the 
generators are lines joining corresponding points of these two rational curves. 
Included, however, among the co 2 directors are the oo 1 which consist of the 
fixed directrix and a variable generator. 

In case r is even there are oo 1 directrices, the maps of lines on a , which 
are rational norm curves of order / . Included in this system is one curve 
which is the map of directions at . As before the co l generators are the maps 
of lines on but this system includes the one line which is the map of directions 
at a . The line Off is mapped into directions on the surface about the point 
where the generator a meets the directrix . If TT , p are two lines on a the 
mapping system can be expressed in the form TT-I + p2 2 where 2i, 2 2 each 
is the system of I lines on . Hence any two of the directrices lie in skew 
Si s and the generators are lines joining corresponding points on the two. 

In either case by estimating the number of constants involved in the choice 
of the skew spaces; in the choice of the rational curve in each; in the projec- 
tivity between the two curves set up by the generators; and by allowing for 
the freedom in the choice of the skew spaces for given surface, we find that the 
number of projective constants of the M 2 is (r + 2) 2 - 7, whence the M 2 
admits a 6-parameter collineation group. This group for r odd is the map of 
the 6-parameter collineation group of the plane with fixed point 0; for r 
even it is the map of the 6-parameter quadratic group with fixed / -points 
at , a . 

Since it is r 1 conditions that an M 2 in S r+i be on a point, we see that 
there are o 2 M 2 s on r + 5 points in general position. Thus on 8 points in 
S 4 there are &lt; 2 Mi s, or on 9 points a finite number; on 9 points in S- a there 
are oo 2 M 2 s which fill up a spread, whence for 10 points in S 5 there is a single 
condition invariant under regular Cremona transformation which expresses 
that the 10 points lie on an M* . 

The system of plane rational curves of order I on the base O l ~ l [O l ~ l , a] 
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has the dimension r 1 . Let d ( i 1 , , r) be linearly independent 
in this system and let TT , p be two lines on . If then we set ?H , = irCi , 
ni = pd, where nii, M; are the linear forms in S r+ i which cut M r 2 in the maps 
of the given plane curves, we find that the equation of M 2 is 

(4) Wl 2 Wr =0. 

KI n-i n r 

Conversely a manifold in S r+ \ defined by such a matrix is in general an M 2 
mapped as above. 

In the case r = 21 a parametric equation of M 2 is 

(5) .TO = (a Q t) (GOT) , .TI = (i t) (i T } l , , x r+ \ = ( a r+ i t) ( o r +i r ) . 

For given r we have one of the *x&gt; 1 generators; for given t one of the oo 1 
directrices. In the case r = 21 + 1 the parametric equation is 

(6) a-o = (aoO(oT) +1 , .TI == (: (a, r) +1 , 

.r r+ i = (av+iO (a r +iT) m , 
where 

(a&lt;O(OiT / ) m = (i = 0, -, r+ 1). 
This is in fact the projection of (5) for r = 21 + 2 from a point t , T upon it. 

Special cases of these rational surfaces occur. Thus cubic curves on the base 
O 2 , o , o map the plane upon an Ml in 84 . This mapping system can be 
reduced to conies on the base by quadratic transformation with F-points 
atO,o,o unless o , o coincide with in two distinct directions. Thus cubics 
with node at and fixed nodal tangents determine an If i in S 4 which is more 
properly the projection of an M 2 in S 6 from two points on its directrix conic. 
This special J/l is obtained in S 4 by joining a point directrix to a cubic curve 
director. Unless expressly mentioned special M 2 s of such types will not be 
considered. 

We shall now prove 

THEOREM 9. An M 2 in S r+ \ is transformed by the Cremona involution 
x i = 1/j; (i = 1 , , r + 2) with r + 2 F-points on the M 2 into an M 2 . 

The space x is mapped in the involution upon the space x by the system 
of spreads of order r + 1 with r-fold points at the F-points which are the 
maps from the plane of the points j)\ , , p r+2 Then, for r = 21 + 1 , the 
transform of M 2 is mapped from the plane by curves of order 2 ( / + 1 ) 2 with 
a 21(1 + l)-fold point at and (21 + 1 )-fold points at pi, , p-n+z , for 
r = 21, by curves of order (/ + 1) (21 + 1) with an 1(21 + l)-fold point at 
0, a (21 + l)-fold point at a\ and 2/-fold points at p 1} , pzi+z- We 
have merely to show that the two latter mapping systems can be transformed 
by ternary Cremona transformation into systems of order I + 1 on the bases 
O 1 or O 1 , cr respectively. For odd r this transformation is effected by using 
first the Jonquiere transformation J l+l of order I + 1 with Wold point (center) 
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at and simple F-points at pi , , p z i , then a quadratic transformation with 
P-points at p z i+i, PII+I, Pii+z, and finally the transformation J m again. 
For even r we use first a quadratic transformation with / -points at , a , pi , 
then a J l with center at and simple P-points at p z , , PII-I, then the 
quadratic transformation with P-points at p&lt;u, pii+i, P2/+2, and finally ./ 
again. It is easily verified that these transformations effect the required 
change in the mapping system and the proof is complete. 

The three theorems which follow relate to special sets of points when, for 
given k , n is sufficiently large. 

THEOREM 10. // Pn is on a rational norm curve N k in S k , then its associated 
QT k ~ 2 is on a rational norm curve N n ~ k ~ 2 in S n - k -t The n parameters of the 
tu o sets on their respective norm curves are protective. If ( n k 2 ) k 
= I + 1 &gt; , the set Q is projected n/xm the set P from /mi/ one of the H 
spaces LI which are (/ + 1 )-secant to N"~ k ~ 2 . 

THEOREM 11. // Pn is on &lt;tn elliptic norm curve E k+l in St, then its associated 
Q%~ k ~ 2 is on an elliptic norm curve E n ~ k ~ 1 in S n -k- Z . // ( n k 2 ) /; 
= / + 1 &gt; 0, the set Q is projected upon the set P from any one of the co sjmces 
LI which are ( / + 1 )-secant to E n ~ k ~ l at the I + 1 jiuinf* nit out by a quadric on Q . 

THEOREM 12. // Pn is on a rational norm surf ace J/ij" 1 in Sk,then its asso 
ciated Q n n ~ k ~~ is on a rational norm surface NT k ~ 3 in $ n -k- Then parameters 
of the two sets of generators on the points are projective. 

In Theorem 10 let the norm curves in Sk and S n -k-z have the respective 
parametric equations 

3-0 = 1, 3-1 = t , , X k = t k ; 

a-o = l, 3-1 = t, , Xn-k-z = t n ~ k ~ 2 ; 

and let the sets P n , Qn be determined on these curves by the parameters 
ti, - ,t n . If Xi , , A n are determined by the n - - 1 equations 

Xi/{ +A 2 ^+ +X B /i = (i = 0, 1, ,11 - 2), 

then the points of the one set, affected respectively by factors of proportion 
ality AI , , \ n , satisfy with the points of the other the bilinear relations 
requisite for association. We observe that here P^ is obtained by projection 
of Q^T* from oo n ~ 6 spaces LI rather than &lt;x n ~~ 7 spaces as in the general case of 
Theorem 3. 

In Theorem 11 let the canonical parameters of P k on E k+l be u lt , u n 
where u\ + + u n + b = 0. Choose then a mapping system on a base 
B such that the members meet E k+1 in n k -- 1 variable points and also 
in a certain number of fixed points whose parameters sum up to b . Then, 
if k + 1 points of P* are on an S k -i ,!++ Uk+i = and n k+2 + 
+ u n + b = 0, whence the complementary n k I points of P are on a 
member of the mapping system or the n k 1 points of QT k ~ 2 , mapped 
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from Pi, are on an S n _ fc _ 3 . Thus E k+l is mapped upon E n ~ k ~ l and P* is 
mapped upon its associated Q n n ~ k 2 . In this way we find upon each of the 
associated sets P?, Q%, 2 elliptic norm curves, upon each of the associated 
sets Pi, Qg, co 1 elliptic norm curves, and upon each of the associated sets 
Pg , Qg, a unique elliptic norm curve. 

For Theorem 12 we give the details of the proof only for the case k = 21 + 1 . 
Then Afir 1 is the map of the plane by curves of order I + 1 on the base O l , a 
and Pn is the map of a set irl in the plane. If n == 2m + 1 then is the center 
and a , TT"^ the simple /"-points of a J m+ ~ whose inverse center and ^-points 
are , a- , ir n . Curves of order m I -- 1 on the base O m map the 
plane on an M%~ k ~ 3 and map ir n 2 upon a set Ql~ k ~ 2 which is associated to P* . 
For if k + 1 points pi , , p 2 i+i of P* are on an Sk-i there is a curve of order 
I + 1 with /-fold point at and simple points at cr , TT\ , , 7^+2 This 
curve is transformed by J m+2 into a curve of order m I I with (m I 2 )- 
fold point at and simple points at TT ZI+S , , ir zm+i Hence the points 
&lt;?2z+3 , , q n are on an S n -k-z in S n _ t _ 2 . If, however, n is even we take a , a 

to be a pair of ordinary corresponding points for a J 1+ 2 . 

It should be observed however that an A/1 in S 3 , an ordinary quadric, counts 
in two ways as a ruled normal surface. It is mapped from the plane by conies 
on , &lt;r and as the points are interchanged in the above proof two normal 
surfaces in S n - k ^ are obtained. Hence 

THEOREM 13. // P% is on a quadric surface which is not a cone, its associated 
Q^T is on two normal M^~ 6 -s in S n _ 5 . 

A simple statement of the relations among the P-points of a Jonquiere 
transformation can be given in terms of associated sets. 

THEOREM 14. Given the Jonquiere transformation J n+1 with center at p and 
simple F-points at P\ n , then curves of order n 2 with an (n 3)-/oM point 
at p map the plane upon an Ml n ~ 5 in S 2 n-4 and map the set P\ n upon a set Ptl~ 4 
which is associated to the set Q% n of simple F-points of the inverse transformation. 

The proof of this is immediate by the foregoing methods. 

We now proceed to particular sets beginning with Pi , Q$ . The o 1 elliptic 
quintics, E b s, on Q| are obtained by the mapping of P| on Q| by conies on the 
9th base point p g of the pencil of cubics on Pi . This pencil becomes a pencil 
of E d s on an M\ on Q| and the generators of M\ , which arise from the lines 
of the plane on pg, are bisecants of all these 5 s. However, each of the co 1 
5 s on Qs has oo 1 A/J s on it, whose generators on points v\, v 2 satisfy the 
involution vi + v z = k .* That particular A/1 common to all the E~ s is 
determined by the involution cut out on any E* by quadrics on Q| . For if, 
in the plane, MI + u^ + +u& + ug = 0,vi+ + v & + u* = , u i + w 2 
+ Ug = represent the sections of a cubic of the pencil by respectively a cubic 

* Segre, Mathematische Annalen, vol. 27 (1886). 
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of the pencil, a mapping conic, and a line on p 9 , then, on writing the second 
relation in the form (vi + -^9) + + (vs, + \ug) = in order to introduce 
the canonical parameter v = v + \u 9 on the mapped E , we have for Q & 
and the meets of a generator of the unique Ml the relations 

(Ml + i9) + + Us + &gt; 9 ) = |9, ( l + i"9) + (" 2 + i9) = -- |"9, 

whence on E 5 v{ + + v B + w i + w 2 = and the ten points are a quadric 
section. 

We may relate Q% and any one of the oo 2 Mi s on it to Pi in the plane as 
follows. Let Pi, Rl be P-points of a J 5 with centers at p, r where p is any 
one of the oo 2 points of the plane. Then if p\, p 2 , ps are on a line, the points 
r^, , r% , r are on a conic. Hence conies on r map the plane upon an 3/ 2 
in St in such a way that Rl is mapped upon the set (?| associated to PS 

In addition to the oo 1 5 s on Q$ there are = 2 rational quintics R b on Q%. 
These are in one-to-one correspondence with the o 2 3/1 s on Qg. For, given 
an M\ on Q|, of the 7 linearly independent quadrics on Ql three are on 3/1 
(the three determinants of the matrix (4)) and of the remaining four one is 
on the directrix of 3/1 and cuts 3/1 in a residual R b trisecant to the directrix 
and unisecant to the generators. Conversely, given an R 5 on Q% it has a 
unique trisecant (with parameters determined by the canonizant of the binary 
quintic apolar to all S 3 sections) whose points are in 1-1 correspondence with 
the points of the curve (the correspondence being determined by making the 
three points common to the curve and trisecant self-corresponding) and the 
lines joining corresponding points are generators of an Ml on Q$ . The ques 
tion then arises as to the nature of the spread which is the locus of the =o 2 
R b s on Ql or the nature of the condition that a $9 be on an Pi 5 , and as to the 
corresponding condition on the associated Pg . The two theorems which 
follow answer these questions. 

THEOREM 15. There are two Mi s on a given Qg lehich are covariantly related 
to the set under regular Cremona transformation. They are isolated by the same 
irrationality as separates the two reguli on the unique quadric on the associated 
set Pg . The parameters of the 9 generators of one of the Mi s on Qg are pro- 
jective to those of the 9 generators of one of the reguli on Pg . // the set Ql lies 
on an R 5 (a single condition) then it lies on but one Ml and its associated Pg lies on 
a quadric cone. 

Two Mi s in S 4 meet in a set Qg . That on Qg there are two 3/1 s is proved 
by Theorem 13. That there are only two is proved as follows. The oo a 
3/1 s on #t are loci of &lt;x* bisecants of the oo 1 E 5 s on #|. One of these 3/1 s, 
say ml, is a locus of bisecants of each of the 5 s; the others are each a bisecant 
locus of only one E b . If then an 3/1 is on a 9th point q 9 there is a bisecant of 
an E&gt; on &lt;? 9 ; if two 3/i s are on &lt;? 9 their plane cuts ml in the 4 meets of two 
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bisecants with their respective E s. Hence this plane cuts ml in one of the 
director conies on it. A third bisecant on Q g would have to be in this plane 
else there would be two director conies with two intersections, whereas such 
conies have only one. A director conic meets each of the oo 1 E s in three 
points and on this conic there is an involution of triads whose joining triangles 
envelop another conic. Hence on the point q g in the plane of this conic there 
are just two lines of this envelope each belonging to one of the two Mi s on Qg . 

If Qg is on an R 5 which must lie on one of the two Mi s on Qg and must 
cut its directrix in three points and each generator in one point, then in the 
notation of the proof of Theorem 12 the R 5 must be the map of a rational plane 
quartic with triple point at and on IT S , , TTg as well as on a . But then 
a must coincide in some direction with 0, and the quadric on Pi mapped by 
conies on , a is a quadric cone. 

An E 5 in S$ is projected from a line into an elliptic plane quintic with five 
nodes and from a line which meets E &gt; into an elliptic plane quartic with two 
nodes, whence the bisecant locus of E b is a quintic spread on which E* is a 
triple curve. The &lt;x&gt; 1 E r s on a given Q% can be put into 1, 1 correspondence 
with a pencil of plane cubics and therefore can be named rationally in terms 
of a parameter X . Through a point there pass two bisecants belonging to 
two of these E b s, whence the aggregate of these bisecant spreads of the &lt; l 
5 s constitute a quadratic system. The two bisecants isolate the two Mi s 
on Qt and the given point, whence if they coincide the two Mi s coincide and 
the given point and Q% are on an R b . Hence 

THEOREM 16. // X 2 B + 2\Bi + 5 2 = is the quadratic system of bisecant 
spreads of the oo 1 E 5 s on Q%, the spread B\ B 5 2 = (a spread of order 10 
with Q-fold points at Ql and a double Ml consisting of the oo 1 E 5 ,&lt;?) is the locus of 
the o 2 rational quintics on Q\, or the locus of points through which there can be 
drawn but one line bisecant to an E 5 on Q\ , or through which there can be passed 
but one Ml on Q|. Its equation may be obtained by replacing in the condition 
that a quadric on Pi be nodal (a condition of degree 8 in the coordinates of each 
point of Pi whose terras consist of products of 18 determinants p; PJ pkpi\) 
each determinant \p il p; 2 p,- 3 p,- t \ by the complementary determinant 1 5 r i e ? 8 g r t 7 5 r . 8 9, | 
formed for Qg and alloiving the Qth point to vary. 

Here then we have an instance of the actual determination of a covariant 
of Qt or an invariant of Qg under the infinite group of regular Cremona trans 
formations attached to the set. 

We complete the discussion of sets of 9 points with the Qg associated with 
the set Pg . In S 5 the elliptic norm sextic E 6 has one absolute and 36 projec- 
tive constants; the rational sextic R 6 has three absolute and 38 projective 
constants; and the lf| has 29 projective constants; whence on $9 there is a 
finite number of E 6 s, 2 R 6 s, and o 2 jl/| s. There is, however, in S 5 a new 
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type of rational 2-way of order 4, the Veronese surface V\, which shares 
with M 2 the property that its projection from one of its points is an A/1 . The 
V\ is the map of the plane by the linear system of all conies in the plane. It 
contains &lt;x&gt; 2 conies, the maps of lines of the plane, and the locus of the 2 
planes of these conies is a F| upon which V\ is a double manifold. Analytically 
V\ is obtained by setting a 3-row symmetric determinant of linear forms equal 
to zero and V\ is the locus for which the six first minors vanish. The T \ is 
unaltered by an 8-parameter collineation group, the map of the ternary 
group, whence it has 35 8 = 27 projective constants. We should expect, 
therefore, to find on Qg a finite number of Il s. The surface V\ shares with 
il/| also the property expressed by 

THEOREM 17. The Veronese surface V\ is transformed into a Veronese surface 
V z by a regular Cremona transformation whose F-i&gt;oiiift&lt; an- on V\. If the 
regular transformation in S- a is y ( = l/.r,- (i = , , 5 ) the two I l s arc mapped 
by conies from planes which are in correspondence nmlcr the frn/uri/ &gt;////t/ftr trans 
formation with 6 double F-points. The V\ with double VI is transformed info 
the V* with double V f . 

Indeed the given involution maps the S.j(z/) s upon a system of quintic 
spreads with 4-fold points at the 6 .F-points on V\. This is the map of a 
ternary system of 10-ics with 4-fold points at (} points, which can be trans 
formed by the ternary transformation mentioned into a system of conies. 
The* same involution transforms a cubic spread with nodes at the 6 F-points 
into a similar spread, whence V\ on V\ passes into V on V* . 

Upon V\ there is a linear system of 9 6 s, the maps of cubic curves in the 
plane. Conversely an E 6 on V\ is cut out by a quadric which meets V\ in a 
residual conic, whence the corresponding quartic in the plane breaks up into 
a line and a cubic. Therefore there are no other 6 s on V\ . The conies on 
V\ are trisecant to the E 6 s on V\ . A canonical elliptic parameter on the plane 
cubic is mapped into a canonical parameter on E s whence the planes of 1*4 
are those which meet E 6 in three points for which u\ + u 2 + its = . Ob 
viously any two of these planes lie in an 84 and meet in a point. But the same 
thing is true of the three other involutions for which u-i + 1/2 + "3 = w/2 . 
Hence on E 6 there are 4 V| s or also there are 4 Tf s which contain E s doubled. 
Such a Vl must contain every bisecant of 6 . The locus of bisecants, Bl , of 
E 6 is a 3-way of order 9 which has E 6 as a 4-fold curve, since from a plane E 6 
is projected into a plane sextic with 9 nodes, and from a plane which meets 
6 the E 6 is projected into a plane quintic with 5 nodes. Hence the bisecant 
locus is the complete intersection of two of the four V\ s and the four lie in a 
pencil. A member of this pencil other than a V\ also contains B\. Given 
then a trisecant plane for which Ui + u 2 + w 3 = k , the above pencil of JFfs 
contains the three bisecants in the plane, whence one member, say W\ , contains 
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the plane. Since any plane for which v\ + v* + i s k meets the above 
plane in a point, W\ must meet this latter plane in its bisecants and an outside 
point and therefore must contain it. Hence W\ is the locus of the o 2 tri- 
secant planes v\ + v z + Va = -- k or also of the co 2 trisecant planes u\ + w 2 
+ 3 = k. For each of the 4 Ffs in the pencil of Wfs the two systems of 
generating planes coincide into a single system, since k = k when k = co/2 . 
Hence 

THEOREM 18. An E 6 is contained on 4 V\ s whose Ff s are in the pencil of 
spreads W\ on the bisecant locus Bl of E 6 for which E 6 is a 4-fold curve. A 
particular W\ of the pencil with double E 6 has the two systems of o 2 generating 
trisecant planes for which Ui + u z + 113 = k, k which coincide for the 4 Ffs . 
Under regular Cremona transformation inth F-points on E 6 the properties of this 
pencil are invariant. 

That there are on E 6 four F| s may be seen by the use of an elementary 
theorem. Isolate one of the F| s as the map of a plane. The iTs of the other 
three Ffs cut the isolated one in E 6 doubled, whence in the plane we have the 
square of a cubic expressed in three ways as a symmetric 3-row determinant 
whose elements are conies. But we know that a cubic can be expressed in 
three ways as a symmetric 3-row determinant of linear forms, since it is the 
hessian of three cubics and the square of a symmetric determinant is sym 
metric. Moreover we know that the relation of the hessian to the three 
cubics involves the three half periods. 

THEOREM 19. On a general set Ql there is a unique E 6 and four V%s . 

We see at once that an E 6 and an E G on Ql could not have different absolute 
invariants. For an E & on $9 is projected from a properly chosen trisecant 
plane into an E 3 on the associated Pg, and E 6 into an E s on Pi, whence, 
since E 3 and E cannot coincide, the set Pg is the special set of 9 base points 
of a pencil of 3 s and Qg is also a special set. If, however, there were an E 6 
and an E 6 on Qg , then on projecting from 79 we should have in &lt;S 4 an E 5 and 
E on R% , members of a pencil on an M\ in 84 . Hence in 185 there are o l 
elliptic quintic 2-way cones with vertex at &lt;? 9 and on q\ , , q%, and with no 
other points common to any two. A quadric on Qg and four generators of any 
one of these cones meets the cone in an E 6 on Qg, whence there is a pencil of such 
6 s on Qg with all values of the absolute invariant and again Qg is the special 
set above. This unique E 6 and therefore Ql also carries four F| s. There are 
no F| s on Qg which are not also on E 6 , else there would be on such a V\ an E 
on Q{j. 

THEOREM 20. // PI is the set of base points of a pencil of E s s, its associated 
Qg is the set of base points of a pencil of E 6 s on a V\ , the map of the plane by conies. 

This is an immediate consequence of the elementary theorem that if three 
of the points of such a planar set are on a line the remaining six are on a conic. 

Trans. Am. Math. Soc. 9. 
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We observe that for such a pencil of 6 s on a given V\ each E 6 according to 
Theorem 18 is contained on three other H s whence this special Ql is on &lt;x&gt; 1 
Fa s one of which is isolated while the others divide into triads which depend 
rationally on a parameter. 

If 8 points of such a special Ql are given, the locus of the 9th is a 3-way, four 
of whose points are on any E 6 through the given 8 points. This 3-way is the 
extension of the Weddle surface and bears the same relation to the hyperelliptic 
functions of genus three as the Weddle to those of genus two. This relation 
will be discussed in a forthcoming paper. 

If Pg of Theorem 20 is the set of flex points of an E s , the base points of a 
syzygetic pencil, then any two are on a line with a third, whence 

THEOREM 21. There exists in S 5 a set of 9 points invariant inidn- a Hesse 
collineation Gn& u4th the property that an// lira ]&gt;&lt;&gt;int&gt;t determine u tliinl nuch that 
the remaining six are on an S^. The configuration contains 12 &lt;S 4 s, eight n 
each point. 

This set of 9 points has the unusual property that if six be selected which 
form a reference 6-point, no other one can be taken to be the unit point, since 
each of the other three must lie in one of the reference S 4 s. Using a proper 
set of six as reference points the coordinates of the other three are 

*&gt; 1 *&gt; 

u, co- , 1, or , , co ; 

-1, 1, co 2 , -1, -co 2 , 0; 
1, -1, co, 0, -co, -1 ( = e-- 3 ) . 

The problem of obtaining the four surfaces V\ on a given Qg may be solved 
through the use of the associated set Pi as follows: 

THEOREM 22. On the E s on Pg join the Qth base point of the pencil on -p\ , 
, p$ to p 9 to meet E 3 again in p . From p draw a tangent to E 3 at p" (4 
choices) . Construct a set r 9 , r\, , r 8 congruent to p" , p\, , p s under ,P 
with centers rg , p" . Then conies map the set Rl upon the set Qg associated to Pg 
and map the plane upon one of the four V\ s on Q$ . 

We now consider sets of 9 and of 10 points in &lt;S 5 with reference to the normal 
surfaces M\ and the rational sextic curves R 6 . We have noted that on Qg 
there are &lt; 2 M| s and =o 2 R 6 s. Only co 1 of the M^s contain the unique E 6 
on Qg . For the M\ mapped from the plane by cubic curves on the base O 2 , 
a contains =o 8 E 6 s which are mapped from quartic curves with nodes at 0, 
a whence M\ and E 6 on it have 37 constants. But E 6 alone has 36 constants, 
whence on E & there are o l M^ s. These are the bisecants of the o * involutions 
u + u = k, since lines on cut out such an involution on a ternary quartic 
with node at . An M\ on Qg and not containing E 6 can have no other point 
in common with E 6 . For if E 6 were to meet M\ in 10 points, at least four of 
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the quadrics on M\ would contain E 6 . But four such quadrics meet in a 
residual conic. We now prove 

THEOREM 23. The locus of the o 2 Mi s on Ql is a cubic spread with the E 6 
on Qlfor double curve. A point of this cubic spread forms with Qg a symmetrical 
set Q1 which are the meets of two M\ s and whose associated set Pj is on a quadric 
surface. The cubic spread is that locus of oo 2 trisecant planes of E 6 whose meets 
with E 6 lie with Qg on a quadric. 

For the condition that PI O is on a quadric surface is of degree two in each 
point pi and therefore is a sum of products of 5 four-row determinants. The 
corresponding condition on Qio is a sum of products of 5 six-row determinants 
and therefore is of degree three in each point g,- . Since the condition on P? 
is invariant under regular Cremona transformation this is likewise true of Qi . 
Hence if g 10 is variable the cubic spread must have nodes at Qg. According 
to Theorem 13, Qi is the set of points of intersection of two l/| s on Qg. 
Since the cubic spread contains the so 1 Af^ s on Qg which contain E 6 , it con 
tains the bisecant locus B\ of E 6 and therefore is a member of the pencil of 
Theorem 18 and contains E 6 as a double curve. To prove the trisecant plane 
property we observe (and omit the verification) that if a quadric contains M\, 
a plane on this quadric meets M* in a point. Given then an M\ and a plane 
trisecant to E 6 at v\, Vi, s such that u\ + + "9 + i + Vz + v 3 = 0, of 
the 6 quadrics on M\ and therefore on Qg at least four are on v\ , v z , Vz and at 
least one contains the plane i Vz 03 which therefore meets M\ in a point. 
As M\ varies in the = 2 system on Qg, this point runs over the trisecant plane. 

An jR 6 on Qg is on a unique M\ on Ql and vice versa. For given the M\ 
mapped by cubics on O 2 , a the R 6 s are mapped from ternary quintics with 
4-fold point at and simple point at cr, whence on Qg there is a unique R 6 . 
These R 6 s meet the generators in one point and the directrix conies in four 
points whose four parameters on conic and on R 6 are protective. Given R 6 
on Qg, its quadrisecant planes each carry a unique conic with the projective 
4-point property just mentioned and the locus of these conies is the unique 
M | on Ql and R 6 . 

If in the proof of Theorem 12 the point a is on a ternary quintic which 
maps into an R 6 on $9, then &lt;r coincides with in some direction and the set 
P?o is on a nodal quadric. For such a set the two lf| s coincide. Hence 

THEOREM 24. The two conditions that Ql be on an R 6 are that its associated 
Plo be on a nodal quadric. On such a Qio there is but one M\ . 

The &lt;x&gt; 1 Mfs on Qg which contain the E 6 on Qg are obtained by mapping 
Pg on Qg in the co 1 ways described in 1. All of the &lt;*&gt; 2 Iffs on Qg are ob 
tained by the following construction. 

THEOREM 25. For the set Pi we choose a center p (in 2 ways) and, for 
arbitrarily chosen p w , construct a set ri, , rg , a , congruent to pi, , 
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p 9 , pio, p under J 6 with centers , p. Then cubics on O 2 a map TI, , r g 
upon the set Qg associated to P 9 , and map the plane upon one of the = 2 M 2 s 
on Qjj. 

For if pi, pz, ps are on a line, then r 4 , , r 9 , a , O 2 are on a cubic or &lt;/ 4 , 
, q g are on an &lt;S 4 . That these J/| s are all distinct follows from the fact 
that the &lt;x 2 line pencils from p to Pi are projectively distinct. We observe 
that, when p has been chosen and thereby an M\ isolated, the variation of pw 
implies the variation of the point vO of M\ over the M\ . 

We shall close with an application to the sets of 10 nodes of a rational 
plane sextic and of a symmetroid quartic surface S . These two figures are 
related as follows. The sextic S (t) has a conjugate rational sextic R(t) in 
space such that the plane sections of the one are apolar to the line sections of 
the other. The locus of planes which cut R(t) in catalectic sextics is S (as 
an envelope) and the 10 planes which cut R(t) in cyclic sextics (reducible to 
a sum of two sixth powers) are the ten double planes of S . If such a cyclic 
sextic is (pit) 6 + ( ;&gt; 2 O 6 = 0, then ( pi t ) ( p t) = are the nodal parameters 
of a double point of S ( t ) . Thus the nodes of 2 and the nodes of S ( t) are in 
correspondence. It is known that there are two projectively distinct rational 
sextics S ( , S (T) which determine the same 2 . I have proved but not yet 
published the fact that if 6 nodes of S (t) are on a conic then the complementary 
4 nodes of S are on a plane. Hence conies on the plane map the plane on a 
V\ in SB and the ten nodes of S (t) upon a Qf on V\ which is associated to the 
P?o of nodes of S . But also conies of the plane of S (T ) map this plane on a 
V z and the ten nodes of S (r ) upon the same Qi , since this set also is associated 
to P? From this there follows at once 

THEOREM 26. // two Veronese surfaces V\, V ^ meet in 10 points, Qio, then 
this set is associated to the set Pl of nodes of a Cayley symmetroid. The spreads 
V\ &gt; V i &gt; with double V\ , V respectively, each cut the double spread of the other 
in a 12-ic curve with nodes at Q1 . These curves are the maps from the plane of 
the two rational plane sextics associated ivith the symmetroid. 

UNIVERSITY OF ILLINOIS, 
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ON ALGEBRAIC FUNCTIONS WHICH CAN BE EXPRESSED IN 

TERMS OF RADICALS* 

BY 

J. F. RITT 

I. INTRODUCTION 
We consider, in this paper, an irreducible algebraic relation 

.F(w,z} =0, 

of degree n in w and of genus p , and seek to determine those cases in which w 
can be expressed in terms of z by means of radicals. Why the results obtained 
here should not have been found before this time is a question which has 
puzzled us as much as it will puzzle the reader. 

Our results for the case in which n is prime are fairly complete. W r e show 
first that if n is prime, and if w can be expressed in terms of radicals, then for 
every genus except zero there exists an upper bound for n . For any given genus 
the mechanisms of the possible Riemann surfaces for w can be determined in 
a finite number of steps. 

For the case of genus zero our problem becomes that of finding all rational 
functions whose inverses can be expressed in terms of radicals. Given one 
such function, others can be obtained by performing linear transformations 
upon the function and upon the variable. We show that all rational functions 
of prime degree with inverses expressible in terms of radicals can be thus 
obtained from functions of the following types: 

(a) The powers of w . 

(b) The polynomials which occur in the formulas for the multiplication of the 

argument in the function cos u . 

(c) The fractional rational functions which occur in the formulas for the trans 

formations of the periods of the function &lt;@u . 

(d) For the case of n = 1 (mod 4), the fractional rational functions met, in the 

lemniscatic case (g s = 0) , in the formulas for the multiplication of the 
argument of %) 2 u by a (3i , where a 2 + @? = n 2 . 

(e) For the case of n = 1 (mod 6) , the fractional rational functions met, in the 

equianharmonic case (&lt;fe = 0) , in the formulas for the multiplication of 
the arguments of &lt;@ u and &gt; 2 u by a (3i V3 , where a 2 + 3/3 2 = n 2 . 
The functions just described are already known to have inverses expressible 
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in terms of radicals. We prove here that they are the only such rational 
functions of prime degree. 

What we have done for the case in which n is composite is to determine 
all polynomials whose inverses can be expressed in terms of radicals. In 
stating our result we employ the terms of our paper Prime and composite 
polynomials.* A polynomial F (z) is there called composite if there exist 
two polynomials, &lt;/&gt;i(z) and &lt;fo(z), each of degree greater than unity, such 
that F(z) = &lt;i [ &lt;fo ( z ) ] Otherwise, F ( z ) , if of degree greater than unity, 
is called prime. Let 

F = fa fa &lt;t&gt;r , 

where each 0i(z) is a prime polynomial, and is understood to be substituted 
for z in the polynomial which precedes it. 

We show that if the inverse of F (z) can be expressed in terms of radicals, 
each 0i ( z ) , if not of degree 4 , can be obtained by means of linear transformations 
either from a prime poicer of z or from a trigonometric polynomial of prime degree- 
We will have, that is, if (z) is of degree in ^ 4 , 



where Xi(z) and \2(z) are linear and where either 7r(2) = z m or else 
cos mu = TT ( cos u ) . 

The work for this case, with n composite, consists mainly in the proof of a 
theorem on substitution groups. 

II. FUNCTIONS WITH A PRIME NUMBER OF VALUES 
If the degree n in iv of the irreducible algebraic relation 
(1) F(w,z)=Q 

is prime, and if w can be expressed in terms of radicals, the group of monodromy 
of (1) is either the metacyclic group or one of its transitive subgroups. The 
group of monodromy must contain a substitution of order n. We number 
the branches of w in such a way that the group contains the substitution 
(12 n ) , and represent the metacyclic group with the formula 



(mod,) (J I 0&gt; J % "; ~_ } 



The non-identical substitutions with a = 1 displace every index v . The 
substitutions with a ^ 1 leave a single index fixed. If a substitution of the 
metacyclic group consists of more than one cycle, its cycles are all of the 
same order. 

Suppose that w has q critical points. We consider the elementary sub 
stitutions of the group of monodromy, which correspond to single turns 

* These Transactions, vol. 23 (1922), p. 51. 
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around these critical points. Suppose that there are a of them with a =-- I 
and q a with a ^ 1 . We designate the orders of the latter by Si , s 2 , , 

Sq a . 

If the genus of (1) is p, we have, according to the well known formula of 
Riemann, 



(n- l) a +^-(s i - 1) =2(n - 1) + 2p , 

4=1 Si 



or 



Since no Si is less than 2 , the first member of (3) is not greater than q/2 , 
and we find from (3) 

(4) S4 +^l 

which shows that when p is given, there exists an upper bound for q, inde 
pendent of n . 

We shall prove now that if p is not zero, there exists an upper bound for n 
which depends only on p . 

Suppose first that q &gt; 4. W 7 e have then from (4), 



4p 

n - 1 g - 



If q = 4 , equation (3) gives 



71 -- 1 Si- 

It is seen quickly that the second member of this last equation is at least equal 
to 1/6 . We have thus, in this case, 

n - 1 ^ 12p. 
If q = 3 , equation (3) gives 



n 



The three integers the sum of whose reciprocals is less than unity by as small 
a positive number as possible are 2 , 3 and 7 . We have thus for this case 



n - 1 ^ 

A closer examination of the problem would lead to smaller bounds for n than 
those found above. 

On being given p, we can, with the help of equation (3) and of the upper 

* If q a &lt; 3 , we have a stronger inequality. 
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bounds for n and q, determine the possible Riemann surfaces for w in a finite 
number of steps. We shall not follow this question further in the present paper. 

We consider the case of p = , which includes the most interesting examples 
already known of algebraic functions expressible in terms of radicals. We 
must have q = 2 , 3 or 4 . 

Following our paper referred to in the introduction, we shall call the sum 
of the orders of the branch points of an algebraic function at a given point 
the index of the function at that point. The sum of the indices of the inverse 
of a rational function of degree n is 2n 2 . 

If, when we represent the substitution at a critical point in the form (2), 
the coefficient a belongs to the exponent d modulo n ,. where d &gt; 1 , the sub 
stitution is of order d, and the index at the critical point is ( u -- l)(d -- l)/d. 

We shall consider first those cases in which the Riemann surface for has a 
branch point of order n -- I. Such a branch point must be present at 
infinity if w is the inverse of a polynomial. As the index of w at any critical 
point is at least (n -- l)/2, and as the sum of the indices of w is 2rc 2, 
we must have, in this case, q = 2 or q = 3 . 

If q = 2, w must have two branch points of order n -- I . Subjecting z 
to a suitable linear transformation, we may suppose that one of these points 
is at infinity and the other at zero. The surface thus obtained is recognized 
as that for ic = z lln . The functions uniform on it are rational functions of 
z 1/n . Of these, the only ones which are inverses of polynomials are linear 
integral functions of z 1/re . 

If q = 3, the remaining two critical points of u- , since each has an index 
not less than (n -- l)/2, must each have precisely (n -- 1 )/2 as index. 
Subjecting 2 to a suitable linear transformation, we may suppose that the 
branch point of order n -- 1 is at infinity, and the other two critical points at 
2 = 1 and z = - - 1 respectively. The substitutions at the latter points are 
of the form 

v = ~ v + hi, v = - v + ho (mod n) , 

respectively. The substitution at infinity, the result of following the second 
of these two by the first, is 

v = v + hi - ho (mod n) . 

As this substitution is of period n, we have hi ^ ho. If we renumber the 
branches of w , giving to the branch numbered v the number n determined by 
the congruence 

(5) " = (hi - hz}+ n ~h l (mod??), 

the three elementary substitutions become 

/ = -M, M S-M--I, /z sp + 1 (mod ?i). 
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Thus only one mechanism is possible for the surface of w . Now it is well 
known that the trigonometric polynomial of degree n , / ( w ) , defined by the 
relation 

cos nu = f n ( cos u ) 

has an inverse expressible in terms of radicals, the critical points of the inverse 
being at 1 , 1 , and o . Hence w must be a rational function of f~ l (z) . 

Summarizing the foregoing results, we see that the only polynomials of 
prime degree whose inverses can be expressed in terms of radicals are those of the 
forms a ( w + b ) n + c and af n ( bw + c ) + d , where cos nu = / ( cos u ) . 

We consider now the case of q = 4 . As the index of w at each critical 
point is at least (n -- 1 )/2 and as the sum of the indices is 2n 2 , the index 
of w is precisely (n -- l)/2 at each critical point. The corresponding sub 
stitutions are all of order 2 . Subjecting z to a suitable linear transformation, 
we may throw one of the critical points to o , and so dispose the others that 
the sum of their affixes e\, e 2 , and e 3 is zero. Let the substitutions at e\, e*, 
and e s be respectively 



i/ = v + hi , v = -- v + h z , v = v + h 3 ( mod n ) . 

Suppose that hi and h% are unequal. If we give to the branch of w numbered 
v the number p. determined by (5), the three substitutions become 

// = - M, M = ~ M -- 1. 
( i hi ) JLI = ( h-2 hi ) M + h 3 hi ( mod n ) . 

As h z varies from to n -- 1 , we obtain n types of surfaces which, it will be 
seen below, are all distinct. 

If hi and hi are equal, they must be distinct from h 3 , else the surface would 
not hang together. We can in this case reduce the three substitutions to 

jn = jn, n = n, M M 1 (mod ?0 . 

Thus, the critical points being disposed as described above, there are at 
most n + 1 distinct surfaces for w. To identify these, we construct the 
elliptic function g&gt; O|o&gt;i, co 3 ) , with &?(,-) = et (i = 1 , 2, 3). This is 
possible, since e\ + e 2 + 3 = . Let 

fii 

Q 3 

where ad be = n . It is well known that there are n + 1 distinct trans 
formations (6), and that for every transformation, we have 



where R(w) is a fractional rational function of degree n, whose inverse can 
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be expressed in terms of radicals. The critical points of R~ l ( z ) will correspond 
to those values which g? ( u \ wi , co 3 ) assumes twice at a point, namely, e\ , ez, 
e z , and &lt;x&gt; . It is easy to show that the n + 1 Riemann surfaces for the 
inverses of the functions R ( w ) , occurring in the n + 1 distinct transforma 
tions (6), have distinct mechanisms. The n + 1 surfaces exhibited above 
can be none other than these. 

We pass finally to the case in which q = 3 and in which no branch point of 
order n -- 1 exists. We must have, by (3), 

1 + 1+1 = 1. 

Si 82 S 3 

There are the three possibilities: 

(a) *i = So = j , 5 3 = \ ; 

(b) s 1 = s 2 = s 3 = I ; 

(f) *i = \ , s 2 = i , s 3 = | . 

Consider Case (a). As every Si is a divisor of /; -- 1 , \ve must have n = 1 
(mod 4) . Subjecting z to a suitable transformation, we can place the critical 
points with substitutions of order 4 at and o and the third critical point 
at any point e\. Normalizing the substitutions at the critical points, we find 
that there are not more than two distinct mechanisms for the surface. We 
now take $ u so that &(MI) = e\, g&gt;(aj&gt;) =0, and &gt;(co 3 ) = e\. This 
corresponds to the lemniscatic case. Now, as n = 1 (mod 4) , we have 
11 = o? -)- j3 2 } where a and /3 are integers. In the lemniscatic case, the two 
functions &lt;$? (a. j3i)u are rational functions of $~ u .* The rational func 
tions thus obtained can be inverted in terms of radicals. It is not hard to 
identify their surfaces with those of Case (a). 

Similarly, it is found that Cases (b) and (c) lead to the rational functions 
mentioned in Case (e) of the introduction, which are met in the multiplication 
formulas for the equianharmonic case.f 

III. POLYNOMIALS OF COMPOSITE DEGREE 

We consider a polynomial w = F ( z ) , of prime or composite degree n , 
and seek those cases in which F~ 1 (w), the inverse of F (z ) , can be expressed 
in terms of radicals.! 

* For details relative to complex multiplication in the lemniscatic and equianharmonic 
cases, see Ritt, Periodic functions with a multiplication theorem, these Transactions, 
vol. 23 (1922), p. 16. 

f A detailed discussion of all Riemann surfaces considered in this section is contained in 
a paper by the writer, Permutabk rational functions, now in the hands of the editors of 
these Transactions. 

J We have interchanged the roles of w and 2 in order to conform with the notation of our 
paper referred to above. 
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If F ( z ) is a composite polynomial, that is, if 

^(z) = &lt;M&lt;fc(s)], 

where &lt;i(z) and &lt; 2 (s) are of degrees greater than unity, then if F~ 1 (io) 
can be expressed in terms of radicals, fa 1 (w) and fa 1 (w) can also be so 
expressed, for 

tfr O* ) -^[F-^w)], ^ (w) -- F^i^^w)]. 

We may thus restrict ourselves to the determination of prime polynomials 
whose inverses can be expressed in terms of radicals. 

We will show that if the inverse of a prime polynomial is expressible in 
terms of radicals, the degree of the polynomial, if not equal to four, is a prime 
number. Thus, using the result found for polynomials in the preceding section, 
we will know that if F (z) has the decomposition into prime polynomials 

F = &lt;j&gt;l &lt;f&gt;2 &lt;f&gt; T 

each $&gt;i(z} is either of degree 4, or else is of the form Xi 7rX 2 , where Xi (2) and 
\2 (z) are linear and where TT (2) is either a prime power of z or a trigonometric 
polynomial of prime degree.* 

We refer to II of our paper, Prime and composite polynomials, for a proof 
of the fact that a necessary and sufficient condition that a polynomial be prime 
is that the group of monodromy of its inverse be primitive. 

It is well known that the degree of a primitive solvable group is a power of 
a prime. As the substitution corresponding to the branch point at infinity 
of the inverse of a polynomial of degree n consists of a single cycle of n letters, 
the proof that prime polynomials whose inverses can be expressed in terms of 
radicals are either of prime degree or of degree 4 will be complete as soon as 
we have proved the following theorem on substitution groups: 

THEOREM. A primitive solvable group in p m letters with p prime and m &gt; 1 
cannot contain a substitution of order p m , except in the case of p = 2 , m = 2 . 

Let G be a primitive solvable group of degree p m . Suppose that G contains 
a cyclic subgroup C of order p m . 

It is well known that G contains an invariant transitive abelian subgroup 
F , of order p m , every substitution of which, except identity, is of order p . 
As F is permutable with C , these two groups generate a group H in G , the 
order of which is the product of the orders of F and C , p 2m , divided by the 
order of the group of substitutions common to F and C . The order of H is 
a power of p greater than p m . The substitutions of H are all of the form 
cy where c and j are substitutions of C and F respectively. 

The group C is invariant in a subgroup of H of order greater than p m . 

* Certainly if each tpi (z) is of one of the three types described, F" 1 (w) can be expressed 
in terms of radicals. 
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Hence there must be substitutions of H which are not in C , and with respect 
to which C is invariant. Suppose that cj is such a substitution, where 7 is 
not in C . Then, since 

y- l c~ l Ccy ---- y~ l Cy == C, 

C is invariant with respect to certain substitutions y of F , which are not in C . 
Let 

Cl = ( 1 2 v p m - 1 ) 

be the substitution which generates C. We shall determine the group of 
substitutions which converts c\ into a power of itself. Let a be a substitution 

such that 

a" 1 c i a = c[ , 

where r is any integer not divisible by p . It is evident that a is determined 
as soon as r , and the index s by which a replaces , are given. Consider the 
substitution given analytically by 

(7) v = rv + s ( mod p m ) . 

Noting that Ci has the representation v = v + 1 (mod //") , we see that a" 1 c\ a 
has the representation v = v + r (mod p m ) . Thus a transforms c i into c[ 
and replaces by s . The group in which C is invariant is given by (7), where 
r assumes all values prime to p, and where s is unrestricted. 

We shall now impose the condition that a substitution of the form (7) 
belong to F, but not to C . We have for a p the representation 

v = r p v + s ( r?- 1 + r"-2 + + 1 ) ( mod p m ) . 

As a is not in C , we cannot have r = 1 (mod p m ) . Since a belongs to F, 
a p is identity, so that 

(8) r" = 1 ( mod p m ) , 

(9) s (rv- 1 + r?- 2 + + l)=Q (modp m ). 

By Fermat s theorem, 

(10) r^ = r (modp), 

so that, by (8) and (10), r = 1 (mod p) . Let r = kp + 1 . We have 

r p-* = (p -i)kp + l (mod 



so that 

rP -i + rP -2 + +1 = Z [(P- )*? + !] (modp 2 ), 

i=l 

x&gt; 2 ( p - 1 ) 
^ 
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Suppose that p &gt; 2 . Then p 1 is even and 

(11) rP~ l + r p -* + + 1 = p (mod p 2 ) . 

That is, the first member of (11) is divisible by p, but not by p- . Hence, 
referring to (9), we see that s is divisible by p m ~ l . 
Suppose then that a has the form 

v = (kp + 1 ) v + lp m ~ l (mod p m ) , 

where kp is not divisible by p m . Since F is regular, if we can show that a 
leaves certain indices fixed, we will know that a cannot belong to F . Con 
sider the congruence 

( kp + 1 ) v + lp" 1 - 1 =s v ( mod p m ) , 

or 

(12) kpv + lp" - 1 = (mod ]/") . 

Since kp is not divisible by p m , we see that (12) must have roots. 

We conclude that for a. to belong to F and not to C , we must have p = 2. 
If p = 2 , we must have, by (8), 

r s= 1 (mod 2" )- 

If HI &gt; 2 , this congruence has the four solutions 

r = 1, r == 2 m ~ 1 1 ( mod 2 m ) . 

Suppose that m &gt; 2 , and that r = 2 m ~ l + 1 (mod 2 m ] . The highest power 
of 2 by which r + 1 is divisible is the first. Since, by (9), s ( r + 1 ) is divisible 
by 2 m , we see that s is divisible by 2 m ~ 1 . That is, a. has the form 

= (2- 1 + !) + 2 m ~ l l (mod 2" )- 

Since this substitution leaves the index 0, or the index 1, fixed, according as / 
is even or odd, we cannot have r = 2 m ~ l + 1 (mod 2 m ) , if m &gt; 2 . 

Suppose that m &gt; 2, and that r = 2 m ~ l - 1 (mod 2 m ) . Then, by (9), s 
must be even. Putting v = v , we have 

(2-i - 2)v + 21 = (mod2 m ). 

Now, since m &gt; 2, 2 m ~ 1 2 is divisible by no higher power of 2 than the 
first, so that the congruence above has roots, and a cannot belong to F. 

We consider finally the case of r = - - 1 (mod 2 m ) . We have the 2 m sub 
stitutions 

(13) v = - v + s (mod2 m ), 

which transform C into itself. If s is even, (13) leaves two letters 
fixed and cannot belong to F . Consider those substitutions for which * is 
odd. We say that if F contains one of them, it contains all of them. For if, 
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in the substitutions 



Si, v = - v + s 2 (mod2 m ), 

si and 2 are both odd, the substitution v = v + (s 2 - *i)/2 (mod 2 m ) , which 
belongs to C , transforms the first into the second. As F is an invariant sub 
group, if either of these substitutions belongs to F the other does also. 
Suppose that F contains the two substitutions 

l, v = -v-l (mod2 m ), 



which we denote by i and 2 respectively. As F is abelian, we have, equating 

a.\ oiz and 2 &\ , 

v - 2 = v + 2 ( mod 2 m ) , 

from which it follows that m = 2 . 

In the case of p = 2, m -= 2, the symmetric group in four letters has sub 
stitutions of order 4 . 

The proof of the theorem is completed. 

COLUMBIA UNIVERSITY, 
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ON THE LOCATION OF THE ROOTS OF THE JACOBIAN OF TWO 

BINARY FORMS, AND OF THE DERIVATIVE OF A 

RATIONAL FUNCTION* 

BY 

J. L. WALSH 

1. Introduction. There have recently been published the following results :f 
THEOREM I. // the points zi , z 2 , 23 vary independently and have circular 

regions as their respective loci, then the locus of the point z 4 defined by the real 

constant cross ratio 

A = (Zi , Z 2 , 2s, 24) 

is also a circular region. 

THEOREM II. Let f\ and fa be binary forms of degrees p\ and p 2 respectively, 
and let the circular regions C\, C* , C s be the respective loci of m roots of fa , the 
remaining pi m roots of f\ , and all the roots of fa . Denote by C the circular 
region which is the locus of points z 4 such that 

( Zi , Z 2 , Zs , Z 4 ) = , 

m 

when zi , z 2 , z 3 have the respective loci C\, C 2 , C 3 . Then the locus of the roots 
of the jacobian of fi and fa is composed of the region C$ together icith the regions 
C\, C 2 , C s , except that among the latter the corresponding region is to be omitted J 
if any of the numbers m , pi m , p z is unity. If a region d (i = 1,2, 3, 4) 
has no point in common with any other of those regions which is a part of the 
locus of the roots of the jacobian, it contains precisely m - - 1 , p\ - - m - - 1 , 
p-i 1 , or 1 of those roots according as i = 1 , 2 , 3 , or 4 . 

It is the primary object of the present paper to consider extensions of 
Theorem II in various directions. Chapter I studies the possibility of extend- 

* Presented to the Society, September 9, 1921. 

t These Transactions, vol. 22 (1921), pp. 101-116; this paper will be referred to 
as II. It was preceded by a paper, these Transactions, vol. 19 (1918), pp. 291-298, 
which will be referred to as I, and was followed by a third paper, these Transactions, 
vol. 23 (1922), pp. 67-88, which will be referred to as III. We shall also have occasion to 
refer to two other of our papers, using the letters A and S respectively: Annals of 
Mathematics (2), vol. 22 (1920), pp. 128-144; Comptes Rendus du Congres International 
des Mathematiciens, Strasbourg, 1920, pp. 349-352. The term locus in Theorem I of the 
present paper replaces the term envelope used in II. 

JThe corresponding region is to be omitted in this enumeration of the points of the locus; 
the corresponding region may nevertheless be in whole or in part, a portion of the locus of 
the roots of the jacobian. 

31 
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ing Theorem II to include regions which are not circular. It is found (Theorem 
III) that the reasoning formerly used cannot be directly extended, and specific 
examples bring out the nature of the difficulties in supplying other modes of 
reasoning. Chapter II treats the extension of Theorem II by increasing the 
number of circular regions which are allowed to be loci of roots of the ground 
forms of the jacobian. Theorems VI and XI are fairly general results obtained 
by this extension, the principal results of the paper. Chapter III is a short 
chapter which deals with centers of gravity; the results are mainly generaliza 
tions of well known results for polynomials and their derivatives. Finally, 
Chapter IV deals with the case of the roots of the jacobian of real forms. 
Theorem XV is a rather general result which applies to the roots of the deriva 
tive of a polynomial which has only real roots. 

CHAPTER I: ON THE EXTENSION OF THEOREM II TO OTHER TH*AN CIRCULAR 

REGIONS 

2. A distinctive property of circular regions. We shall now undertake to 
consider the extension of Theorem II to regions which are not circular. Inas 
much as rather large extensions of Theorem I in this direction can be obtained 
without difficulty, and in fact have been obtained in III, our first tendency is 
to attempt to extend Theorem II by repeating our previous reasoning of II, 
p. 113. This turns out to be impossible, due to the failure of Lemma I (II, 
p. 102) to admit of large extension to other than circular regions: 

THEOREM III. // a closed region C has the property that the force at any 
external point P due to every set of k unit particles in C is equivalent to the force 
at P due to k unit particles coinciding at some point of C , then C is a circular 
region. 

In proving this theorem we do not need to assume the property stated for 
every k , but merely for any one particular k (except of course A- = = 1 , for 
which the result is absurd). We shall suppose k = 2 and leave to the reader 
the modifications for the other values of k . 

If C is the whole plane there is no external point P and we may consider the 
theorem true, since C is a circular region. Similarly, if C is a single point the 
theorem is true. In the sequel we assume C to be neither the entire plane 
nor a single point. 

It will be noted that the force at P due to equal particles at two points 
M and N is equivalent to the force at P due to two coincident particles situated 
at Q , the harmonic conjugate of P with respect to M and N * which situation 
is invariant under linear transformation. \Ve shall proceed to prove the 

LEMMA. // P is exterior to C , and M and N are two points of C , then C con 
tains every point on that arc of the circle MNP bounded by M and N which does 
not contain P . 



This fact is quite easy to prove; see for example A, pp. 128-129. 
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Choose P at infinity and equal particles at M and N; the point Q which is 
the mid point of the line segment MN is seen to be in C . Then the mid 
points of the segments MQ , QN are also in C , and in fact we have a set of 
points in C everywhere dense on the segment MN . Hence this entire segment 
belongs to C . 

The region C contains at least two points. It follows from the lemma that 
C contains an infinity of points. Transform one point R of the boundary of C 
to infinity, and consider two other distinct points V and W of that boundary. 
We proceed to prove that every point of the segment YW is a point of C. 
We can find a sequence of points not belonging to C but approaching R . 
Hence there is a sequence of points (the harmonic conjugates of the former 
sequence with respect to Y and W} belonging to C and approaching the mid 
point U of YW , so U belongs to C . Then the mid points of UV and YW 
also belong to C , and in this way we prove that every point of YW belongs 
to C . But R also belongs to C , and hence we can prove that either every 
point of that infinite segment of R] r which does not contain W belongs to C 
or every point of that infinite segment RW which does not contain Y belongs 
to C; for definiteness suppose the latter. 

There exists an infinite sequence of points not belonging to C but approach 
ing W , and we assume as of course we may do that we have oriented the line 
I ll horizontally and that we have an infinite sequence of such points [Z k ] 
all lying in the lower half plane. If a line through Z k cuts the segment YWR, 
then every point of that line which lies in the upper half plane belongs to C , 
by the lemma. For any preassigned point Y in the upper half plane we can 
choose a point Z k such that YZ k cuts the segment YWR, so every point of 
the upper half plane belongs to C as does also every point of the line YW . 

If the region C does not consist of precisely the upper half plane including 
its boundary, there is a point A" of C in the lower half plane. We can make 
use of the fact that R is a point of the boundary of C as before, and prove that 
the entire finite segment joining A" to an arbitrary point of the line YW belongs 
to C . Then Y and W are not boundary points of C , contrary to our assump 
tion. The demonstration of the theorem is now complete. 

Theorem III is quite easily proved, although by essentially the same 
methods, if we assume C to be bounded by a regular curve. Thus the property 
considered is invariant under linear transformation, for the position of the 
A- coincident particles is uniquely determined by P and the original particles, 
and by Theorem I (I, p. 291 = II, p. 101) P is a root of the jacobian of the two 
binary forms each of degree k and whose roots are respectively the k original 
particles in C and the k coincident particles. If C is not bounded by a circle, its 
boundary can be transformed into a contour which is not convex.* Then 

* See Theorem III of a note by the present writer, Annals of Mathematics (2), 
vol. 22 (1921), pp. 262-266. 

Trans. Am. Math. Soc. 3. 
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there are two points A and B on the boundary of C such that the segment 
AB is exterior to C and such that there is a point P exterior to C and on the 
line AB but not on the segment AB. The force at P due to one particle at 
A and k -- 1 particles at B is equal to the force at P due to k coincident par 
ticles at a point which is on the segment AB but which coincides with neither 
A nor B and hence which is exterior to C . 

The essence of Lemma I (II, p. 102) was proved and applied by Laguerre;* 
his formulation of the result was quite different from the present formulation, 
although the application was to the location of the roots of algebraic equations. 
Thus Theorem III appears as a sort of converse of the theorem of Laguerre, 
as well as of Lemma I (II, p. 102). 

3. Successive application of Theorem II to the determination of loci. The 
property of circular regions stated in Theorem III seems to be conclusive 
in showing that the reasoning of II, p. 113, cannot be reproduced to give 
large extensions of Theorem II. Theorem III justifies, moreover, the some 
what artificial use of circular regions in III, Theorem XIII. 

\Ye now point out by two simple examples how results can be found from 
successive applications of Theorem II. These examples are not given as 
large extensions of Theorem II, but rather to show how difficult is that 
extension to regions which are not circular. The proofs of these theorems are 
left to the reader; the proof of the latter depends on III, Theorem IX. 

THEOREM IV. Suppose we have a finite or infinite mtmber of sets of regions 
C ( i n) , C" 2 n) , C ( s } , C ( 4 B) , of Theorem I corresponding to the value X = pi/m, and 
suppose that no C"i" has a point in common with C ( f } unless i = j (i , j = 1, 
2,3,4). Denote by TI, TV. TV TV the regions common to all the Cf, CT, 
C a\ C ( 4 n) , respectively. Then if T\ contains m roots of a bilinear form f\ , if T 
contains all the remaining pi m roots of f\, and if T 3 contains all the p 2 roots 
of a second form / 2 , then the regions T\, TV TV T \ contain all the roots of the 
jacobian offi and fa . No two of the regions TI , T , T 3 , T have a point in com 
mon, and they contain respectively m -- 1 , p\ m 1 , p 2 -- 1 , 1 of the roots 
of the jacobian. 

There is no reason to suppose that the actual locus of the roots of the 
jacobian is composed of TI , TV TV T , when TI, T 2 , T z are the loci of roots 
of the ground forms. But if the regions C ( ?\ 0? , Cf, Cf have the dis 
position suggested in the first part of 11 (III) or more generally if T 4 is the 
locus of the point 24 determined by its cross ratio pi/m with the points Zi, 
z 2 , 2 3 whose loci are T\ , TI, TV these four regions form that locus, except 
of course that among these latter three the corresponding region is to be 
omitted if any of the numbers m , pi m , p z is unity. 
*CEuvres, pp. 56-63; p. 59. 
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THEOREM V. In the situation of III, Theorem VIII, suppose the point P 
which is a center of external similitude for every pair of the circles Ci, Co, C 
to be actually external to all those circles. Denote by 7\ , T, T the portions of 
the interiors of those circles which lie between two half litres through P cutting 
those circles. Then if T\ and TI are the respective loci of mi and m 2 roots of a 
polynomial f ( z ) , the regions TI, T, T are the loci of the roots of its derivative 
/ (z) , except that 7\ or T 2 is to be omitted if TOI or m- 2 is unity. If TI, T 2 , T 
are mutually external, thei/ contain respectively mi -- 1, m- 2 1 , 1 of the roots 
off (z). 

A theorem similar to Theorem V will be obtained by cutting the circles 
Ci, C-i, C by any convex contour, but no result can generally be stated in this 
case concerning the actual locus of the roots of / ( z ) . 

In discussing the possibility of the extension of Theorem II by reproducing 
the reasoning of II, p. 113, we reached the impossibility of extending Lemma 
I (II, p. 102), by which we may replace the force at an arbitrary point P 
exterior to a region C due to A: particles in C by the force at P due to k co 
incident particles in C. To obtain certain facts, however, concerning the 
location of the roots of the jacobian, it may not be necessary to replace the k 
particles in C by k coincident particles in C for an arbitrary point P exterior 
to C but merely for certain points P exterior to C . This fact is notably true 
if the ground forms are real, as we shall show in Chapter IV; it is also true for 
certain other cases, as we shall now indicate by an extremely simple example. 

We consider the polynomial 

/(z) = z(z - ai) (z - a 2 ), 

where ai and aj 2 have as their common locus the interior and boundary of the 
circle C i whose center is z = 6 and radius unity. Then/ (z) has a root zi 
which has as its locus the interior and boundary of the circle Ci and a root z* 
which has as its locus the interior and boundary of the circle C whose center is 
z = 2 and radius 1/3. Under the given conditions, moreover, zi and z 2 
remain separate and distinct. 

Let us now T consider the same polynomial but assign to the roots ai and a 2 
as their common locus the right-hand semicircular region S i of C\ . Any point 
of the right-hand semicircular region S of C is by III, Theorem IX, a point 
of the locus of z 2 , and we shall prove that no other point is a point of this locus. 
Suppose a point 2 to be a point of the locus of z 2 ; z is evidently in or on C . 
The force at z due to particles at i and o? 2 is equivalent to the force at 2 
due to two particles coinciding at some point a. We know that z is in or 
on C , hence exterior to C\, so a is in or on C i . Moreover, 2 is not in the 
half plane bounded by and lying to the right of the line x = 6 , and hence a 
is in that half plane. Then a is in S( , so 2 is in S . 
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The locus of Zi under the conditions stated includes of course S( , but other 
points as well. In fact, if we choose ai = 6 + i, 0.1 = 6 i, the two roots 
of the real polynomial / (2) cannot be conjugate imaginary, so Zi is real but 
to the left of the point 2 = 6 and therefore not a point of S{ . 

^Ye consider anew the same polynomial and assign the left-hand semicircle 
S i of Ci as the locus of cxi and a 2 Any point of S" is evidently a point of 
the locus of 3j , and no other point belongs to this locus. For any such point 
would lie in the semicircle S , which is impossible, according to the theorem 
of Lucas. Under our conditions the locus of z 2 includes the left-hand semi 
circle S" of C , but also other points. We choose as before ot\ = 6 + i . , 
2 = 6 i , and find z? to be real and on or within S as previously noted. 
But Zo T* 2, and hence is not a point of S" . 

Results which are large extensions of Theorem II to other than circular 
regions seem difficult to prove, as is shown by Theorems IV and V, even when 
the regions involved are common to two or more circular regions. But 
theorems of a certain type are easily established ; we give simply one example: 

Let the intersecting circles C\ and fo trith centers at the points a\ and 2 
and radii r\ and r 2 be the respective loci of m\ and ni-&gt; roots of a polynomial f (z) . 
Let the region common to C\ and C\ be the locus off (z) , irhich is suppofteil 1&lt;&gt; 
have no roots other than those mentioned. Then the roots of f (z) lie in C\ , C 2 , 
and the region common to the two circles whose centers are the 

\ -f- 



mi + &gt; + m 3 
and whose radii are respectively 



The region mentioned in this theorem is not the locus of the roots of / (-) , 
for the intersection of the last two circles mentioned in the theorem can never 
be a root of / (z) . The proof of these facts is left to the reader. 

4. The number of roots of the jacobian in a circular region. A question 
concerning the distribution of the roots of the jacobian which is very closely 
connected with the question of extending Theorem II to regions other than 
circular is that of the number of roots of the jacobian in the regions d when 
two or more of those regions have common points. Thus it might be supposed 
that Ci , C 2 , C 3 , C 4 contain always m , p l m , p z , 1 of the roots of the jacobian 
except for the possibility that this number be exceeded when Ci has one or 
more points in common with another of the regions. This supposition is false, 
however, as we proceed to show r by an example. Thus consider the circles 
C\ and C z in the form of Theorem II corresponding to III, Theorem VIII 
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(i.e., S, Theorem I), such that C intersects P 2 but is exterior to d . Let the 
line of centers of the circles intersect PI in AI (the intersection nearest C 
and P 2 ) and intersect C z in A z and C in A (the intersections farthest from 
PI). When mi roots of /(z) coincide at ^ and m z roots at ^4 2 , we have 
TOI -- 1 roots of/ (2) at ^4i, ??; 2 -- 1 roots at A z , and one root at A. When 
the m z roots at A z move slightly so that they do not all coincide but all remain 
on P 2 and symmetric with respect to the line A\ A z , the force corresponding 
at A and in the neighborhood of A becomes equivalent to the force due to m z 
particles coinciding exterior to C z , since these particles can be considered to 
lie in the circular region consisting of the circle C z and all exterior points. 
Hence the root of / (s) at A moves and becomes exterior to C; the m z roots 
at A z remain in the vicinity of A&gt; 2 and there is no root of / (2) on or interior 
to C. 

The question which we raised has thus been answered so far as con 
cerns a region C which contains no roots of the ground forms. The result is 
essentially the same for a region which does contain a number of roots of the 
ground forms. Consider the case of the derivative of a polynomial, the second 
theorem on page 115 of II, locate m\. roots of/(z) at the null circle C\, and 
locate the remaining m z roots at two points A and B different from PI . There 
are but two roots z\ and z 2 of / ( z ) distinct from A , B , and PI , and these are 
interior to the triangle ABCi, so a circle P 2 can be drawn which includes A 
and B (that is, w 2 roots of /(z)) but includes neither 2! nor z 2 and hence 
contains only m z 2 roots of / (z) . 

The question we have been considering is closely connected with the follow 
ing:* Suppose a circle C contains at least r roots of a polynomial / (z) of 
degree n . What can be said of the number of roots of / ( z ) in P ? 

On the one hand, P may contain all the roots of /(z) and hence all the 
n -- 1 roots of / (z) . On the other hand, P may contain r roots of /(z) and 
yet no root of / ( z ) if merely r &lt; n . In fact, we prove that P may contain 
precisely n -- 1 roots of /(z) and contain a preassigned number p of the 
roots of / (z), if merely p &lt; n -- 1 . Locate one root of /(z) at a point P 
and the other n -- 1 roots at n p I distinct points which lie on a line L 
not passing through P. Then we can describe a circle P which includes the 
n p I distinct points on L and hence p roots of / ( z ) but which contains 
no other roots of / ( z ) . 

* Still another allied question is: Suppose a circle C is known to contain at least r roots 
of a polynomial of degree n; determine the smallest (concentric) circle C which always con 
tains at least m roots of the derived polynomial. 

The circle C exists only if m &lt; r . For n = r = m + 1 , the answer is given by Lucas s 
Theorem. For the case r = 2 , the circle C is readily determined by means of a theorem 
due to Grace, to which reference is made in A, 4. For the case n 1 = r = TO + 1 , 
the circle C is easily found by the second theorem of II, p. 115. For other cases the problem 
seems considerably more complicated. 
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CHAPTER II: ON THE EXTENSION OF THEOREM II TO A LARGER NUMBER OF 

CIRCULAR REGIONS 

5. Problem of the locus corresponding to any number of circular regions. 
Our attempt in Chapter I to extend Theorem II in a form to apply to the 
jacobian of two particular binary forms by considering regions other than 
circular as loci of the roots of the ground forms and finding the corresponding 
locus of the roots of the jacobian was not particularly fruitful. This seems to 
result rather from the nature of the problem itself than from the precise 
methods employed. We now take up the possibility of extending Theorem 
II so as to consider not merely three circular regions but any number of 
circular regions. Let us suppose explicitly that we have the binary forms f\ 
and /2 of respective degrees p\ and p , and that the circular regions C( , C 2 , 

, C m are the loci respectively of p[ , p 2 , , \&gt; m roots of /i and the circular 
regions C" , ( ", , C" are the loci respectively of p [, pi, , pi roots of / 2 , 
where we have 

Pi + v -i + + p m = Pi , 

Pi + P2+ + Pn = P 2 

For convenience in phraseology, we shall suppose that none of these regions 
is either a point or the entire plane unless otherwise stated. We wish then to 
find the location of the roots of the jacobian of /i and/o; not merely to deter 
mine certain regions in which lie or do not lie the roots of the jacobian, but to 
determine the actual locus of those roots under the assigned conditions, as in 
Theorem II. 

Let us consider, for any particular values of the roots of /i and / 2 which 
satisfy our hypothesis, a root f of the jacobian exterior to all the circular 
regions C( , , C m , C" , , C" . This root f is an analytic function of a , 
any root of /i or/ 2 , and hence when a varies over a certain two-dimensional 
continuum, f also varies over a certain two-dimensional continuum. We 
thus have a certain number of regions which may or may not be distinct and 
may or may not have common points which are the loci of the points f . 
We see by the analyticity of the transformation that all the points a. must 
be on the boundaries o their proper regions whenever a point f corresponding 
is on the boundary of its locus.* Moreover, if a point " is on the boundary 
of its locus, and exterior to all the regions C( , , C m , C" , , C" n , we know 
by Lemma I (II, p. 102) that all the points a pertaining to any one circular 
region can be considered to coincide on the boundary of that region. But 
the precise manner of simultaneous variation of these coincident roots on the 

* There is an exception to this reasoning if the algebraic equation defining f degenerates 
and if f is independent of a particular a, but in that case a can be moved at will without 
changing f and so a can be considered as on the boundary of its locus. A similar remark 
applies also below. 
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boundaries of their loci in such a manner that a point f or several points f 
remain on the boundaries of their loci and trace out those boundaries is as 
yet unknown. 

Let us restrict ourselves for the moment to the situation where the circular 
regions C{, , C" n are relatively small, or to be more precise, such that for 
no choice of the roots of the ground forms in their proper regions can two 
roots of the jacobian coalesce exterior to those circular regions; we suppose 
further that no two of the regions C{, , C" and the regions R which are the 
loci each of one of the roots of the jacobian exterior to those circular regions 
when the roots of the ground forms have their proper regions as loci no two of all 
these regions have a point in common. We may allow the roots of the ground 
forms to coalesce in their proper regions; we notice that the circular regions 
C(, C 2 , , C m , C i, C", , C n contain and are therefore the loci of respec 
tively 7/1 -- 1 , p 2 - 1 , , p m -- 1 , 7/1 -- 1 , 7/2 - 1 , , // -- 1 roots of 
the jacobian. There are then m -\-n~2 regions R each of which is the locus 
of one root of the jacobian. When we allow the circular regions to become 
larger and larger, of course the regions R expand also, need not preserve their 
identity (for example, two of them may coincide), and finally these regions 
cover the entire plane. 

Very little is known of the precise nature of the boundaries of these regions 
R .* Their boundaries are not, except in very special instances, circular 
regions, but are curves which presumably have interesting properties with 
reference to the boundaries of the regions C( , , which properties can 
be expressed in a manner so as to be invariant under linear transformation. 
It is evidently true that if we start with any situation C(, , C" n and if we 
allow two of the regions ( {, , C m or two of the regions C (, , C" to co 
alesce, one of the regions R will coalesce with them, and we shall have pre 
cisely the situation of m -- 1 regions C { or n -- 1 regions C", . 

The exact determination of the regions R in any very general case seems 
difficult. If all the original circular regions reduce to points except one of 
them, say C( , we can determine the path of the roots f of the jacobian as a , 
a p(-fo\d root of /i , traces the circle C\ . These roots " , in their totality, trace 
closed curves, for the situation when a returns to its initial position is exactly 
the same as the initial situation. The boundaries of the regions R must be 
composed of these closed curves, or at least of portions of them. If now we 
allow a second one of our circular regions, say C i, to be a non-degenerate 
region, the new locus of the roots of the jacobian will be a number of regions 

* The writer conjectures that when there are q roots of the jacobian in these regions R, 
these regions are in their totality bounded by a degenerate or non-degenerate ^-circular 2g-ic ; 
only the degenerate cases of this curve have ever been treated in detail, except for 9 = 1. 
Compare Walsh, Proceedings of the National Academy of Sciences, vol. 
8 (1922), pp. 139-141. 
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R . The boundaries of the regions R will be curves which are envelopes of 
the curves R corresponding to the region C( and the null regions C" 2 , , 
C m ,Ci=P, f L ,C n, while the point $ traces the circle C [. By con 
tinuing in this way, we have a process for the generation of the regions R in 
any case desired. But the actual determination of the boundaries in a very 
general case would presumably be too laborious by this process; more powerful 
methods will have to be devised. 

The statement has been made that the regions R are not in general circular 
regions; it is perhaps worth while to present a specific instance to illustrate 
this fact. We consider the polynomial 

/(z) == (z - a,)(z -!)(-,), 

where ai = i , a = - i, and the locus of a 3 is chosen to be a circle C$ and its 
exterior, whose center is the origin and radius so large that the loci of the two 
roofs of / (z) are entirely distinct. In the field of force to determine the 
roots of / (z), the force at a point of either coordinate axis due to the two 
particles at i and -- I is in direction along that axis. Hence, whenever/ (2) 
has a root on an axis, a s must also be on that axis. The root J" of/ (z) larger 
in absolute value is determined on the positive half of the axis of reals by the 
particle as at the right-hand intersection of C 3 and that axis, and a second 
particle of twice the mass at the harmonic conjugate* of " with respect to 
the points i and - i; this harmonic conjugate lies to the left of the origin. 
This root f is determined on the positive half of the axis of imaginaries by a 
at the upper intersection of C 3 and that axis, and a second particle of twice 
the mass at the harmonic conjugate of f with respect to i and i; this 
harmonic conjugate lies above the origin. The curve bounding the locus of 
f is symmetric with respect to the coordinate axes and hence is not a circle. 

The characteristic of Theorem II (and indeed also of Theorems VI and XI) 
in comparison with the more general results indicated in this present section 
seems to be a certain linearity. This fact is brought out very clearly in S, 
but also in II, since by Lemma II (II, p. 102) the position of equilibrium is 
determined by its cross ratio with three points, a relation which is essentially 
linear. It is as a result of that linearity that for our particular situations the 
loci of the roots of the jacobian are all bounded by circles. 

6. A condition that a root of the jacobian be on the boundary of its locus. 
We return now to the general case of the preceding section, and shall obtain 
a geometric relation between the roots of the ground forms and a root of the 
jacobian, when all of those roots are on the boundaries of their loci. 

Consider the points i and a^ at which coincide all the p{ roots in C( and all 
the p 2 roots in C 2 respectively; we consider also a root f of the jacobian which 

* Compare 2, 9; also A, pp. 128-129. 
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is supposed not to lie at a common root of the two ground forms or at a multiple 
root of either form, so that f is given by a certain algebraic equation which 
actually contains i and 2 We may write this equation in the form 

(1) fr . _P_ 



. 

f Oil f Oil 

We can hold fast f and all roots of the ground forms other than a\ and a 2 , 
and move a.\ and a 2 depending on each other so as to satisfy (1). Since (1) 
is linear in OL\ and a 2 , when one of these points is made to trace a circle the 
other also traces a circle. When a\ moves so as to trace C[ , a 2 moves so as to 
trace a circle tangent to C 2 . In fact, if a 2 were to trace a circle intersecting 
C 2 , &lt;x 2 would at some time move interior to the region C 2 , and still i and a 2 
would be in their proper loci. Then motion of a 2 holding ai fast would cause 
" to move over a two-dimensional continuum, so " would not be on the bound 
ary of its locus. 

It will be useful to study in some detail the relation between ai and a 2 
defined by (1). The two double points of the transformation (ai, a 2 ) are 

, Pi + P2 
a\ = 2 = f ; i == 2 == &lt;T + T 5 

denote this latter point by a . We have, of course, p{ + p 2 7* , so that these 
two points are distinct. The cross ratio of these fixed points with a\ and a 2 in 
any position is readily computed: 



- on] 



[&lt;- 



[ 2 -n[( 



r + ^^- 2 - 8 



= PJ + P2 . 



Inasmuch as this cross ratio is real, the four points a\ , a 2 , f , a lie on a circle C . 

The circle C is self-corresponding under the transformation (a\, 2 ). 
In fact, two of its points f and a are unchanged while a third point on is 
transformed into a point 2 of the circle; this is sufficient. 

If p( and p 2 are both negative instead of both positive, we have the case 
of the roots of / 2 located in C [ and C 2 . In either of these situations, the 
first case we consider, the points i and a 2 are separated by f and a . For 
a transformation can be made so that k = . The value of the cross ratio 
gives us 

ai - oii = P I + P2 i t = Pi . 
f a 2 p 2 f - 2 p * 

so ai and a 2 are indeed separated by f and a . 

We thus choose f as fixed on the boundary of its locus; the fixed points 
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ai and a 2 (particular values of a! and a 2 respectively) corresponding are also 
on the boundaries of their respective loci. We have already remarked that 
when ai moves from ai along C( , 2 moves from a 2 along a circle tangent to 
C 2 . When ai moves from ai interior to the region C\ , a 2 moves from a 2 
exterior to the region C 2 . When a! moves on C , a 2 moves in the opposite 
direction but also on C . It follows that C cuts C{ and C 2 at angles of the same 
magnitude, and when C is transformed into a straight line the tangents to C( 
at ai and C 2 at a 2 an- ]nirn//e 1 .* There are different possibilities here according 
to whether C cuts C( and C 2 at equal angles or at supplementary angles. We 
leave it for the reader to notice that if the loci C( and C 2 are both interior or 
both exterior to their bounding circles, these two angles are equal; if one 
locus is interior and the other exterior to its bounding circle, these two angles 
are supplementary. 

The second case we shall consider is that of two roots ai and a 2 , of the forms 
/i and/2, of multiplicities -p( and ;/{ , and loci C( and C", respectively. 
Essentially the same formulas apply, except that in (1) and the succeeding 
formulas the numbers p{ and p 2 are replaced by p 2 p{ and - p\ p [ respec 
tively; we suppose pzp i l&gt;\ l&gt; [ ^ 0. The cross ratio of the four points 
ai , a 2 , " , a shows that ai and a 2 are not separated by f or a . Hence if 
ai and a 2 trace the self-corresponding circle C , they trace it in the same sense. 
The angles which C cuts on C{ and C [ are then equal or supplementary 
according as the regions C ( and C [ lie one inside and the other outside their 
bounding circles, or as these regions lie both inside or both outside their 
bounding circles. When C is transformed into a straight line, the lines tangent 
to C\ at i and to C [ at 2 are parallel. 

The third case we have to treat is the remaining situation under the second 
case, where p z p{ pi pi == 0; here the transformation (ai, a 2 ) has but 
the one double point f . It is still true that the circle C through ai , a 2 , f 
is self-corresponding under this transformation. For if we denote by 7 the 
point a 2 corresponding to ai == a 2 , where ai and a 2 are fixed values of ai 
and a 2 , we shall have 

_ k = PiP 1 - _?L?L = P-P I . PPL 



so ai, a 2 , f, 7 are concyclic; the three points ai, a 2 , f of C are transformed 

* We cannot prove here, as in III, Theorem II, that this property holds also for the tangent 
to the boundary of the locus of f at the point f . In fact, if we choose another pair of 
points a( , &lt;4 , leading to the circle C , it is in general impossible for C to cut at equal angles 
C( at a( and the boundary of the locus of f at . For a specific example, see the illustration 
used at the close of 5. 
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into three points a 2 , 7, f of C which, therefore, is self-corresponding. If f is 
transformed to infinity, we have 



2 7 

so 7 is obtained from a 2 by translation by an amount equal to a[ a 2 . 
Then a\ and a* trace C in the same sense. As in our second case, the angles 
which C cuts on C{ and C" are equal or supplementary according as the regions 
C{ and C [ lie one inside and the other outside their bounding circles, or as 
these regions lie both inside or both outside their bounding circles. When 
C is transformed into a straight line, the lines tangent to C\ at on and to C [ 
at ao are parallel.* 

We have now considered all typical cases of two roots a\ and 2 of the 
ground forms. In particular if we choose two roots of a single form which 
have the same locus, the reasoning we have used shows that when f is on the 
boundary of its locus a\ and a 2 must be on the boundary of their common 
locus and must coincide. This may be used to replace Lemma I (II, p. 102). 
The reader may be interested in applying the remarks bf the present section 
to the situations of Theorems II and VI. 

7. A special case of coaxial circles. We have pointed out in 5 that for 
the situation there described of m + n circular regions C i, , C m , C" , , 
C" as the loci of the roots of the ground forms, the locus of the roots of the 
jacobian is not in general a number of circular regions or of regions bounded 
by several circles. But of course there are special situations for which the 
locus of the roots of the jacobian is bounded by circles. This is evidently 
true, for example, if C(, , C m , C [, , C" n are bounded by m + n con 
centric circles or more generally by m + /i coaxial circles having no common 
point. f For such a situation, moreover, the methods used in I can be applied; 
we shall give an illustration of that fact. 

We use the notation of I, p. 293 ff., and suppose the situation simplified 

* This remark enables us immediately to state something about the locus to be determined 
in certain cases. Thus consider the situation and notation of Theorem X (we might indeed 
choose that of Theorem VI). Choose the line through i , , horizontal and draw a 
parallel L through any point 2 on the boundary of its locus. If one point fi (notation of 9) 
is not on L , say below L , and if z is exterior to all the circles Ci , , C H , all other 
points f i are also below L . Then by Lucas s Theorem, z cannot be a root of / ( z ) . 
Therefore all points f ; lie on L and z lies on a circle C . 

If nil , , m n are all greater than unity, no point z interior to a circle C, need be con 
sidered. If any nit is unity, however, the points z interior to d must be considered. The 
writer has been unable to treat similarly (by the method just indicated) this last case, and 
thus completely to prove Theorem X. 

t If the coaxial system is composed of all circles through two points or all circles tangent 
at a single point, we may consider all the roots of both forms to coincide at a single point, the 
jacobian vanishes identically, and the locus of the roots of the jacobian is the entire plane. 
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by transformation as in I. Suppose Ci to contain A- roots of /i (pz k positive 
particles) and I roots of / 2 (pi / negative particles). If / is sufficiently small 
in comparison with /, , and if C 2 and C 3 are sufficiently remote, it seems reason 
able to suppose that we can obtain a region near but exterior to C\ , which 
region contains no root of the jacobian. The circle C\ contains, then, k 
particles each of mass p* and / particles each of mass -- p\. Outside of C 2 
there are p\ k particles each of mass pz and outside of 3 there are pz I 
particles each of mass pi . If Q is a position of equilibrium, we must have, 
in the notation of I, 

V- k &lt; J?lL 4. P- ( P I " * ) 4. ? J ( ^ 2 Ilil , 
a + r~ r a b r c 4- r 

which can be put into the equivalent form 



(2) + r[;; 2 A-(a + M( - c) + pil(a + c)(b -a)] 

+ [ ~ Pi lh a " ( b + c ) + PZ kac ( a + 6 ) + pi Jafe ( a + c ) ] . 

This form does not simplify materially. Denote by C 4 and C 5 the circles 
whose centers are and radii the roots of the right-hand member. The cross 
ratio of the points Ci, C 4 . C s , Cl (notation as in I, p. 294) with the collinear 
points C", C - 2 , C 3 can easily be calculated, but this cross ratio contains a, 
b , c explicitly and is not independent of their ratios; we therefore use a different 
method to describe C 4 and C\ . We are supposing implicitly that the roots 
of the right-hand member of (2) are positive or that at least one of these roots 
is positive. 

If C 4 and C s lie between Ci and C 2 and between Ci and C 3 , they bound an 
annulus which contains no root of the jacobian. For if r = a , the right-hand 
member of (2) reduces to 

2pil(a +c)(b -a), 

so that inequality is satisfied for r --= a and therefore is not satisfied when r 
lies between the two roots. 

Under this hypothesis we can determine the precise number of roots of the 
jacobian in the smaller of the new circles by allowing the roots of /i and / 2 
in Ci to move continuously and to coincide at . When the p 2 k and &gt;i I 
particles are all in coincidence at , the circle C\ contains precisely k -f / 1 
roots of the jacobian, so this is the original number of roots interior to or on 
the inner boundary of the annulus. 

Hence we have, under the assumptions already made: 

1. // the circles 4 and C 5 lie between C\ and C 3 , then the annular region 
between C 4 and C 6 contains no root of the jacobian of fi andf 2 . The region which 
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is bounded by f 4 or $ and contains the region Ci contains precisely k + I -- 1 
roots of the jacobian. 

2. If T 4 and C 5 are separated by C 3 , there are no roots of the jacobian in the 
annular region u hich is part of the annular region bounded by C and C$ and 
which contains no point of the region C 3 . The circular region bounded by Ci 
or 5 which contains the region d but no point of the region C s contains precisely 
k + I -- 1 roots of the jacobian. 

This theorem can readily be expressed in general form so as to include the 
situation after linear transformation; compare the corresponding statement 
in I. 

8. Theorem VI, a general theorem for circles having an external center of 
similitude. There is another fairly general class of loci other than the very 
simple class just considered for which the locus of the roots of the jacobian 
as treated in 5 is bounded by circles. We shall now use a method which 
is novel in some respects but which makes use of our former results to establish 

THEOREM VI. Let the interiors and boundaries of the circles C{ , C 2 , , C n &gt; , 
u hose centers arc a{, a 2 , , a n &gt; , respectively, be the loci of m i, m 2 , , m n &gt; 
roots of the form f i which has no other roots. Let the interiors and boundaries of 
the circles C [, Cl, , C",,, whose centers are a" , a" 2 , , , respectively, 
be the loci of m [, m", , m" n ,, roots of the form f 2 which has no other roots. 
Suppose further that a point P is interior to no circle C\ or C" and is an external 
center of similitude for every pair of the circles (? , C - and for every pair of the 
circles , C" and an internal center of _similitvde for every pair C\, C" . The 
distinct roots of ./ , the jacobian offi andf 2 , when all the roots of the ground forms 
are concentrated at the centers of their proper circles, are denoted by OL\, a*, , 
a n (all of which points are collinear with P , o&gt;{ , , a n , , d{ , , /) of multi 
plicities mi, m 2 , , m n , and by C\, C 2 , , C n are denoted the circles which 
have these points as centers and radii such that P is an external or internal center 
of similitude for every pair of the circles C(, , C n &gt; , C [, , C" n &gt;, , C\, , C n - 
Then the locus of the roots of J , the jacobian of fi and / 2 , is composed of the 
interiors and boundaries of the circles Ci , C 2 , , C n . A circle C, exterior to 
all the other circles Cj contains precisely nii roots of J . 

Limiting cases of the circles C{ , , C n , , C", , C" are the points P 
and P , the point at infinity. We shall admit these circles as possibilities in 
the demonstration of the theorem, providing, however, that there is at least 
one proper circle, which we shall suppose to be C( . If there is no proper circle 
C( or C i, either the only roots of J are P and P , in which case the theorem 
remains true, or every point of the plane is a root of J, in which case the 
theorem is not true. 

The configuration of the three sets of circles has some obvious but interesting 
properties relative to J . Let us choose as horizontal the line L on which lie 
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P, a( , ,a n ,,cl(, , oil,,, with the C t to the right of P, let us number in 
order the two sets of circles commencing with C\ and C [, the nearest circles 
to P , and let us denote by JL/. the left-hand intersection of (. -, with L and by 
v\ the right-hand intersection, with the opposite conventions for // and v" , 
the intersections of C [ with L. The points jui , , M &gt; Mi&gt; . /"" ma y 
be obtained from the points a{, , a n , , a", , a n by a similarity trans 
formation with P as center, and as a line L is allowed to rotate about P its 
intersections with C\ and C have always this same property. In fact, we 
may consider properly chosen intersections of L with C f and C" to be the roots 
of /j and /o ; then the roots of &lt;/ trace the circles C* . In particular, when L 
is tangent to the circles C( , , C n , , C [, , C" , the points of tangency 
have this relation to the points of tangency with C\ , , C n 

To prove Theorem VI, we consider as usual the field of force given by 
Theorem I (I, p. 291 == I, p. 101). We can obtain immediately a qualitative 
idea of the locus of the roots of J . No point Q above both of the tangents 
T and T common to the circles (\, ,C, C\, , ,, C (, , C" n ,, 
can be a root of J . For the force at Q due to the particles situated at the 
roots of /i has a component normal to PQ and that due to the particles situated 
at the roots of / 2 has a component normal to PQ and in the same sense; at 
least one of these components is different from zero. Thus also no point 
below both T and T can be a root of .7 and no point above T (or T ) but not 
above T (or T) yet lying to the right of C [ and to the left of C{ can be a 
root of J . Xo point Q of T (or T ) can be a root of J unless all the roots of fi 
and of / 2 are on T (or T ) which can only occur if these roots lie at the points 
of tangency of T (or T ) with the circles bounding their proper loci; that is, 
if Q lies at a point of tangency of T ( or T ) and a d . Inasmuch as the locus 
of the roots of J is a closed point set there must be some sort of a boundary 
of that locus between any two of the circles d . 

By means of the similarity transformation with P as center, we see that 
every point of the locus as stated in Theorem VI is really a point of the locus. 
To complete the determination of the locus we have merely to prove that if 
a point Q is a point of the boundary of the locus, that point is on one of the 
circles d. 

The interior and boundary of the circle (? or C [ which is the locus of more 
than one root of /i or of / 2 is also the interior and boundary of one of the 
circles d; every point interior to or on such a circle is a point of the locus of 
the roots of J. A point on or interior to two circles C\ and C" 1+1 (and so of 
course C? and C - +l ) is also on or within a circle Cj- in fact there is a root 
a.j of J between &lt; and a. i+l , and the circle Cj whose center is a,- contains in 
its interior the region common to (? and C i+l , for this is true of any circle 
whose center lies between a\ and a. i+l if the circle has the two common tangents 
T and T . 
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It merely remains to consider points Q exterior to all the circles C[, , 
C n &gt;, C i, , C n" or interior to or on at most one circle C\ or C - which is 
the locus of but one root of f l or /, . In any case the particles at the roots 
of /i and f- 2 may by Lemma I (II, p. 102) be considered to coincide in their 
respective loci so far as the force at Q is concerned. 

9. Proof of Theorem VI; replacing of two particles by a single particle. 
The point Q is then to be considered as fixed, and for definiteness to lie to the 
right of P and of course above one of the lines T and T but below the other, 
so that the line PQ actually cuts all the circles CI, , C n , , C [, , C. 
We know the loci of certain particles each representing several of the roots 
of fi , we shall replace two of these particles by a single equivalent particle, 
then the new particle and a third of the original particles by a single equivalent 
particle, and so on until we have replaced all the particles representing the 
roots of fi by a single equivalent particle, and similarly for the particles 
representing the roots of / 2 . A study of the properties of the loci of these 
various particles will enable us to prove that Q is on one of the circles Ci. 
We suppose for the present that Q is exterior to all the circles C . . 

To replace two particles at fi and f 2 of masses mi and m* by a single particle 
f of mass mi + ? 2 so that the force at Q shall be unchanged, we have the 
equation for &lt;T 

mi , ?- mi + m 



f i - g h- q r - q 

where q is the complex number representing the point Q. This equation is 
equivalent to 

(fi- g)(ft " D = mi 
(f-?)(f-fi) 2 

We wish to replace the particles f i and fo , whose loci are the interiors and 
boundaries of C( and C 2 and which represent all the roots of /i having these 
regions as loci, by a single equivalent particle. If we transform Q to infinity, 
we shall have precisely the conditions of III, Theorem VIII; mi and mi are 
both positive and the points f i and f 2 are always separated by f and Q . 
Then the variable circle C of Lemma IV (II, p. 105) moves so as always to 
cut C( and C 2 at the same angle, and cuts also S( , the circle bounding the locus 
of f , also at this same angle. When Q is transformed back to the finite part 
of the plane, it remains true that C cuts Ci and C 2 at the same angle; C cuts S{ 
at this same angle or the supplementary angle according as the locus S( is 
interior or exterior to its bounding circle.* We leave this fact to be verified 
by the reader; this can be done by considering any one circle C under the 

* This difference in behavior, which we shall constantly meet, disappears entirely if we 
project stereographically on to the sphere. 
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transformation of Q from the point at infinity to its original finite position. 
In particular it will be noticed that the line PQ is one of the circles C cutting 
C{, C 2 , S{ at the proper angles. When fi and " 2 are on PQ and are chosen 
as the right-hand (left-hand) intersections of PQ with C( and C 2 , f is on PQ 
and at the right-hand or left-hand (left-hand or right-hand) intersection of PQ 
with S{ according as the locus S{ is interior or exterior to its bounding circle. 
The converse of this statement is also true; such a choice of f leads to a unique 
determination of fx and f 2 as described. 

\\V have supposed Q to be exterior to all the circles C\ ; suppose now Q 
exterior to C{ but interior to C 2 ; we need not consider Q interior to the two 
circles. When Q is transformed to infinity we have a special case of Theorem 
I, but no longer a special case of III, Theorem VIII. The circle C which 
generates as in Lemma IV (II, p. 10.")) the boundary of S , cuts C( and f . at 
supplementary angles. In fact, if we assume C to cut ( { and C 2 at equal 
angles, but not at supplementary angles, when jTi , fT: T are on the boundaries 
of their loci, the line fi f 2 f can be rotated about f so that ~i and f 2 move into 
the interiors of their loci, so f" cannot be on the boundary of its locus. The 
circle Si is found to be cut by C at an angle equal to that cut on C 2 and 
supplementary to that cut on C\ . 

We shall not consider in detail the case that Q is on C( or C 2 ; we need not 
consider Q on both circles. Whether Q, is on or within one circle or exterior 
to all the circles C it is always true that when Q is in its original finite position 
C cuts C( and C 2 at the same angle, and cuts S( at this same angle or the 
supplementary angle according as the locus S{ is interior or exterior to its 
bounding circle. When f t and fo are chosen properly as the intersections of 
f i and C - 2 with PQ , one of the circles C , f is on PQ and on Si , and conversely. 
The tangents to these three circles at those three points are parallel. 

We have thus replaced the particles at f j and " 2 by a single equivalent par 
ticle. So far as the force at Q is concerned, we can replace f i and f 2 at any 
positions in their loci by f in its locus S i , and for any position of f in Si we can 
determine f \ and f 2 in their loci so that the force at Q is the same. If Q is in 
C( or C 2 , the force at Q can be made as large as desired, so Q must be in Si , 
and conversely. If Q is on C( or C 2 , the force at Q can be made as large as 
desired but only in certain special directions, so Q is on Si , and conversely. 
If the region Si is external to its bounding circle, the force at Q is zero for proper 
choice of f and hence of f x and f 2 , and conversely. 

Next we replace by a single equivalent particle f the particle f as just deter 
mined and f 3 , the particle which represents the roots of /i whose common 
locus is C 3 (assuming the existence of this set of roots). No further detailed 
discussion is required of this new situation; as before, a system of circles C 
cuts Si , C 3&gt; and the boundary S 2 of the locus of f at equal angles or at angles 
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supplementary to the angle cut on C 3 according as the loci S( and S 2 do not or 
do include the point at infinity. The line PQ is as before one of the system of 
circles C . When f is on PQ and on 182 , Ts is on PQ and on C 3 and f is on 
PQ and S( , so f t and f 2 are on PQ and C( and ^respectively; moreover, the 
tangents to C{ , C 2 , C 3 at f i , f 2 , sTs , respectively, are parallel. The converse 
of this statement is also true. 

We continue in this same manner to replace pairs of particles by equivalent 
particles, and finally replace all the particles f i , f 2 , , iV representing the 
roots of fi by a single particle 771 whose locus is a circular region Si whose 
boundary is cut by PQ at an angle supplementary or equal to the angle cut 
on C{, C 2 , , C n according as the locus contains or does not contain the 
point at infinity. When 771 is on PQ and on Si , we know that f i , , n &gt; 
are on C( , , C n &gt; respectively and that the tangents to these circles at these 
points are parallel. It follows from reasoning to be given later that at no 
intermediate stage is the locus of one of our auxiliary particles the entire 
plane. 

Similarly the particles 1, 2, " , kn" representing the roots of /&gt; are 
replaced by a single particle 770 whose locus is a circular region S 2 which is 
either one of the points P or P or is bounded by a circle S 2 which is cut by 
PQ at an angle equal to the angles cut on the circles C", C" 2 , , C" n . In 
fact, if all the roots of / 2 are not concentrated at P the particles corresponding 
to the roots of / 2 always exert at Q a force not zero, so the locus of r? 2 does 
not include the point at infinity. When 772 is on PQ and S 2 , w T e know that 
1, &&amp;gt; j SB" are on ^ i. ^2. " &gt; C n" respectively, and that the tangents 
to these circles at these points are parallel. 

10. Theorem VI: proof completed. For Q to be a root of J , the loci Si 
and S 2 must have at least one point in common, at which point are to coincide 
rji and 772 so that their resultant force at Q shall be zero. Such a common 
point cannot be Q, for Q is not a point of S 2 . The loci Si and S 2 cannot 
overlap if Q is on the boundary of its locus, for when Q varies slightly in any 
direction, Si and S 2 vary but slightly. If Si and S 2 overlap, we may vary 
Q slightly in any direction but so little that Si and S 2 still have common 
points, so that Q remains a root of J for some choice of 771 and 77.2 and hence 
is not on the boundary of the locus of the roots of J . We defer until later 
the possibilities that the two circles Si and S 2 coincide or that Si or S 2 may 
be the entire plane or a point. 

The regions Si and S 2 have but a single point in common, and since PQ 
cuts the circles Si and S 2 at equal or supplementary angles according as Si 
does or does not contain the point at infinity, a single point common to these 
two loci must lie on PQ . That is, 771 and 772 lie on Si and S 2 respectively, and 
on PQ , so f i , f 2 , , f , si , s2 , , 4V all lie on PQ and the lines tangent 

Trans. Am. Math. Soc. 4. 
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to the circles f ", , f" 2 , , (",, ("(, (, . . , (* at these points are parallel. 
Then Q lies on one of the circles d .* 

The possibility that the circles Si and S 2 coincide is readily treated. We 
may choose 771 and 772 to coincide on Si and on S 2 , and on PQ . These points 
are still on the boundaries of their respective loci, and hence the previous 
reasoning is valid. 

According to the assumptions already made, the locus Si cannot be a point. 
The locus S 2 will be a point when and only when the roots of / 2 are concen 
trated at P or P or both. But in such a case the single point S 2 is on the 
line PQ and the preceding reasoning holds. 

The possibility that Si or S 2 may be the entire plane remains to be con 
sidered. If one of these loci is the entire plane, the other must be a point; 
otherwise we have essentially the case of overlapping already disposed of. 
The locus S 2 is either the point P or does not contain P , so is never the 
entire plane. If Si is the entire plane, we may suppose S 2 to be a point 
which of course lies on PQ . We prove our former result by a limiting procc-^. 
When a point Q is very near Q but external to the locus of the roots of J , 
the circles S(, S 2 , , Si are very near the corresponding circles for Q; for 
Q the locus Si is certainly not the entire plane. Denote by Z 2 the point at 
which is located the single particle representing all the roots of / 2 , so far as 
concerns the force at Q ; 2 2 is not in the locus Si. When Q approaches Q 
always remaining exterior to the locus of the roots of J , 2 2 approaches the 
point S 2 . The circle Si corresponding to Q becomes smaller and smaller, 
the locus Si never contains 2 2 , so the circle Si approaches the point S 2 . 
We may choose rj] an intersection with PQ of the circle Si corresponding to 
Q , and we shall have the points fi , , &lt;;&gt; on PQ and on the circles C{ , 
, C n &gt; . When Q approaches Q , PQ approaches PQ , the point 771 ap 
proaches S 2 and the points ft, , n &gt; approach points on PQ and on C{ , 

, C n . These limiting points can be taken as corresponding to 771 coinciding 
with S 2 and thus give our result that Q lies on one of the circles C\, , C n . 

Theorem VI is now completely proved except for its last sentence. When 
we notice the number of roots of J in a region d and remark that if the roots 
of /i and/ 2 are varied continuously then the roots of J vary continuously, and 
that if d is exterior to every other circle Cj no root can enter or leave d, 
this last sentence is seen to be true. It hardly need be added that a number of 
circles C t - which may have common points but which have no point in common 
with any other circle Cj contain a number of roots of J equal to the sum of the 
multiplicities W; corresponding to their centers as roots of J . 

* The mere fact that the tangents at these points are parallel does not rule out certain 
isolated points Q on the line Pa( a n , d[ a",, but a more detailed consideration of 
the loci Si, 82, " does rule them out without difficulty. 
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11. Generalization of Theorem VI by transformation. In Theorem VI we 
have assumed that P is interior to none of the circles C( , , C n &gt; , C [, , 
C n" . The theorem is still true if this assumption is omitted, even if we permit 
roots of one or both forms to lie at infinity, except that the locus of the roots 
of J may be the entire plane, and will surely be the entire plane if P is interior 
to or on the circles C( , , C n &gt; , C", , , and if no roots of either ground 
form are constrained to lie at P . The proof as given requires only a few minor 
modifications to apply to this new configuration. If the circles C\ , , 
C n , C", , C n" all have the common center P , the circles bounding the 
locus of the roots of J are not determined by the position of their centers as 
described in the statement of Theorem VI, but are to be determined for 
example by their points of intersection with an arbitrary line through P, 
precisely as we considered the points ^ and n in S. 

Theorem VI has the advantage over Theorem II of being entirely sym 
metrical with respect to the two forms f\ and / 2 . The special case of Theorem 
VI where there are two circles C[ , C 2 and merely one circle C [ leads to merely 
one circle C\ distinct from the three original circles. For this case, Theorems 
II and VI give the same result. But of course Theorem II is more general 
than this particular situation. Thus, the result for the jacobian problem of 
the theorem stated in II, pp. 114-115, or indeed of the problem of 9 where 
Q is interior to C 2 is included in Theorem II but not in Theorem VI. There is, 
however, a general theorem concerned with an indefinite number of circular 
regions C( , , C n &gt; , C", , C n which generalizes all possible situations 
of Theorem II and which is to be proved in 13-15 by the methods we have 
been using. 

Theorem VI as stated is not invariant under linear transformation. If 
we perform such a transformation we obtain the following new result: 

THEOREM VII. // the loci of the roots of fi are circular regions bounded by 
circles each of ivhich is tangent internally to a circle L\ and externally to a circle 
LI (tangent to both LI and L internally) and if the loci of the roots off 2 are circular 
regions bounded by circles each of which is tangent externally to LI and internally 
to L 2 (tangent to both LI and L 2 externally), and if these loci are so related to their 
bounding circles that they contain neither the entire circle LI nor the entire circle 
LZ , then the locus of the roots of the jacobian J offi andfz is a number of circular 
regions bounded by circles each tangent internally to LI and externally to L z or 
tangent externally to LI and internally to L* (tangent to LI and L 2 internally or to 
LI and L 2 externally) and such that each region is so related to its bounding 
circle that it contains neither the entire circle LI nor the entire circle L* . The 
exact location of the circles bounding the locus of the roots of J can be determined 
by allowing the roots of the ground forms to coincide on LI or on L 2 , always 
remaining in their proper loci; the roots of J are the points of tangency with LI 
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and LI of the circles desired. The number of roots of J in any of these circular 
regions having no point in common with any other of these regions is the multiplicity 
as a root of J of the points of tangency of its boundary with LI and L z under these 
conditions. 

There is a limiting case of this theorem yet different from Theorem VI 
where LI is a straight line and L 2 a proper circle, but we shall not state the 
result in detail. 

12. Application of Theorem VI to the roots of the derivatives of polynomials. 
Theorem VI and in fact Theorem VII can be used to obtain results concerning 
the roots of the derivative of a rational function by means of the remark made 
in I, p. 297 (or II, p. 114). We thus consider C{ , , C n &gt; as the loci of the 
roots and C [, , C" n as the loci of the poles of the given function. The 
regions C\ , , C n together with the possibility of the point at infinity 
appear as the loci of the roots of the derivative. There is a peculiar difference, 
however, betw r een this locus and the corresponding locus of the roots of the 
jacobian. A region C" which is the locus of more than one pole of the original 
function is the locus of at least one root of the derivative, with the exception 
that no point of the bounding circle C" can be a root of the derivative unless 
it is interior to another region C"; we leave to the reader the verification of 
this statement. 

We shall dwell at some length on an application of the above remark applied 
to Theorem VI, which is indeed a special case of that theorem, concerning the 
derivative of a polynomial: 

THEOREM VIII. Let the interiors and boundaries of circles C\, , C n 
whose centers are a\, , a n be the loci of nil, , m n roots respectively of a 
polynomial f (z) ivhich has no other roots; suppose these circles to have a common 
external center of similitude P actually exterior to all these circles. Denote by 
g(z} the polynomial f (z) when all its roots are concentrated at the centers of their 
proper circles, and denote by a( , , a n &gt; the distinct roots of its derivative g ( z ) , 
of respective multiplicities m(, , m n &gt; . Then the locus of the roots of f (z) is 
composed of the interiors and boundaries of the circles C{, , C n &gt; whose centers 
are a{, , a n &gt; and whose radii are such that P is a common external center 
of similitude for the circles C\, , C n , C{, , &gt; . A circle C , which has 
no point in common with another circle C j contains m( roots of f (z) . 

An extreme degenerate case of this theorem is when all the C; are null circles, 
and / ( 2 ) is identical with g ( z ) . The case of merely two circles brings us 
back to a theorem given in II, p. 115. 

Inasmuch as the circles C i have P as a common external center of similitude, 
Theorem VIII can be applied again to the polynomial / (s) and shows that 
the roots of/" (z) lie on or within certain circles C [, , C". The most 
obvious consideration of the geometric situation shows that any point on or 
w r ithin one of these circles actually is a point of the locus. 
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It is to be noticed, however, that this reasoning can be used only if we 
know that the circles C\ contain respectively m\ roots of f(z), and it is 
pointed out in 4 that one of these circles does not necessarily contain pre 
cisely that number of roots of f (z) and in fact may contain no such root. 
Thus the reasoning can be used only if we know that the circles (7- are mutually 
external. This is always the case for a given set of values of i , , o^if the 
circles d are sufficiently small, so in the following theorem we require that 
the circles d be sufficiently small. This means, more explicitly, that the 
theorem is true for a definite derivative g$ if the circles (7; are so small that no 
circle C*" has a point in common with a circle C ( j" } (i ^ j), for m = 1, 2, 
, k - 1. 
Thus we have 

THEOREM IX. Let the interiors and boundaries of circles C\, , C n whose 
centers are a\, , a n be the loci of m\ , , m n roots respectively of a polynomial 
/(z) which has no other roots; suppose these circles to haw a common external 
center of similitude P actually exterior to all these circles. Denote by g(z) the 
polynomial f ( z ) when all its roots are concentrated at the centers of their proper 
circles, and denote by a * } , , a% the distinct roots of its kth derivative g w ( z ) , 
of respective multiplicities m^ , , m ( fh . Then if the c rcles C i are sufficiently 
small the locus of the roots of / (t) (z) is composed of the interiors and boundaries 
of the circles C"/ 1 , , C ( ^ whose centers are ol^ , , a ( ,*l) , and whose radii 
are such that P is a common external center of similitude for the circles C\, , 
C n , C ( i } , , (7% . A circle C ( V ivhich has a point in common with no other 
circle C ( j } contains precisely m ( ? } roots of / w ( 2 ) . 

The special case of this theorem where there are but two of the original 
circles Ci and C 2 has already been proved by another method.* For this 
special case we make no restriction on the size of the circles d . 

A limiting case of Theorem IX is that P is infinite but the points a\ , , 
a n finite, and the radii of Ci , , C n finite. The circles Ci, -, are then 
all equal. The theorem is true for this limiting case. In fact, suppose a root 
R of f w ( z ) to be exterior to all the circles C ( f&gt; , , C% . We can choose 
circles S\ , , S n having a finite point P as common external center of 
similitude and such that R is also exterior to all the circles S ^ , , S^il, 
corresponding. This shows that every point of the locus is on or within 
Ci c) , , C",*!); the converse is easily seen from translation of the situation 
for g(z) and &lt;/ (fc) (2) . This result may be expressed somewhat loosely as 
follows : 

THEOREM X. // the loci of the roots of a polynomial are the interiors and 
boundaries of sufficiently small equal circles whose centers lie on a line L, the 
locus of the roots of the kth derivative / (i) ( 2 ) consists of the interiors and bound- 
* Walsh, Paris Comptes Rendus, vol. 172 (1921), pp. 662-664. 
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aries of circles equal to these whose centers also lie on L and depend only on the 
centers of the original circles. 

This new theorem for k = 1 is not a special case of Theorem VI and can 
easily be expressed in a form invariant under linear transformation, thus giving 
a new result for the jacobian of two binary forms (compare 14) and for the 
derivative of a rational function.* 

The approximate determination of the roots of the jacobian of two binary 
forms, of the derivative of a rational function, or of any derivative of a poly 
nomial is thus made, by Theorems VI-X, to depend essentially on the deter 
mination of the roots of the jacobian, of the derivative of a rational function, 
or of any derivative of a polynomial which has all its roots real. 

The extreme simplicity of Theorem X immediately raises the question of the 
truth of that theorem if the supposition of the collinearity of aj , , a n 
is omitted. We can easily prove that the theorem is not true under this 
changed hypothesis by means of the remark of (&gt;. It is surely true under 
the changed hypothesis that every point on or within a circle C\ that is equal 
to the Ci and whose center is a: is a point of the locus, but it is not true without 
further restrictions that the locus consists precisely of the points on and 
interior to the circles ( , . 

Consider a polynomial g(z) with three simple roots on, a-2, ots, which are 
not collinear, so that neither root a( , a 2 of g (z) is collinear with a pair of the 
points cti, 2 , 3 Choose the equal circles Ci, C\, C 3 , with centers a\, a*, 
as, of such small radius that for no possible choice of the points in their 
proper loci can we have two roots /3; and /3, of /(z) collinear with a root @ k of 
/ (z). Suppose (Si to be on f i; we choose /3,- of such a nature that 
&i ~ di = jS i a l . The circle C through /3i , j3 2 , 0i is not a straight line, the 
points j3i and /3 2 cannot satisfy the requirements of 6 with regard to the circles 
C , Ci , and C\ , from which, it follows that /3J cannot be on the boundary of its 
locus. 

13. Theorem XI: an extension of Theorems II and VI. We now come to 
the proof of the general theorem mentioned in 11, which includes Theorem 
II as well as Theorem VI : 

THEOREM XI. Let fi and / 2 be two binary forms, and let circular regions 
C i&gt; C 2 , , C n &gt;; C", C", , C n" be the respective loci of m( , m 2 , , m n &gt; 
roots of fi (which has no other roots) and of m [, m" 2 , , m n roots of / 2 (which 
has no other roots). Suppose there is a family of coaxial circles S each of which 
cuts at the same angle all the circles C ichich bound loci interior to them, and at 

* (Added in proof) : It seems to the writer probable that Theorem IX is true with no re 
striction on the size of the circles Ci . The special case of this more general theorem where 
the circular regions d are half planes is for the case k = 1 contained by a limiting process in 
Theorem IX, and for all values of k has been established by Mr. B. Z. Linfield, in a paper to 
be published in these Transactions. 
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the supplementary angle all the circles C- which bound loci exterior to them, and 
which cuts at this same angle all the circles C" which bound loci exterior to them 
and at the supplementary angle all circles C" which bound loci interior to them. 
Then the locus of the roots of the jacobian of fi and fa is a number of circular 
regions bounded by circles Ci , C 2 , , C n each of which is cut by every circle S 
at an angle equal or supplementary to the angles cut by S on C\ and C"; the regions 
which are the loci of the roots of the jacobian may be either internal or external to 
their bounding circles. The circles C\ , C 2 , , C n are included among the circles 
traced by the roots of the jacobian when all the roots of fi and fi are concentrated 
o)i the circles bounding their proper loci and move so that one of the circles S con 
stantly passes through them all, while the lines tangent to these bounding circles 
C\ , C" at the points which are the roots of fi and / 2 (and the lines tangent to Ck 
at the points which are the roots of the jacobian) all become parallel ivhen S is 
transformed into a straight line. Any region Ci having no point in comnmn n-ith 
any other region fy contains it number of roots of the jacobian equal to the multi 
plicity of the root of the jacobian trhich traces that circle Ci under these conditions. 

We shall first undertake to prove this theorem for the simplest case, namely, 
that the circles *S form a coaxial family having no point common to all those 
circles. We transform so that the circles S have a common center P . If the 
given circles C\ , C" are all straight lines, all the regions C( , C" have a common 
point, the locus of the roots of the jacobian is the entire plane, and the theorem 
is proved. In any other case, all the circles C(, , C n &gt; , C", , C" n are 
equal ; the loci corresponding to the former can be considered to lie inside the 
bounding circles and those corresponding to the latter to lie outside the bound 
ing circles. 

We shall use the same method of proof as was used in 9, namely, the 
replacing of all the particles in the field of force corresponding to the roots of 
/i by a single particle T\\ , the replacing of all the particles corresponding to 
the roots of / 2 by a single particle rj 2 , and the study of the loci of 771 and 772 . 
We shall prove that no matter what may be the location of the circular regions 
which are the loci or the distribution of the roots of the ground forms among 
these loci, the locus of the roots of the jacobian is always the entire plane. 

If a point z is exterior to all the circles C\ , C" and exterior to all the circles 
S which actually cut those circles C\ , C" , then z is a root of the jacobian. 
For if n" &gt; 1 , z lies in C [ and C" 2 , may lie at a multiple root of / 2 , and hence 
is a point of the locus. If n" = 1 , replace the particles fr and f 2 of masses 
m i and m 2 whose loci are C( and C 2 by a single equivalent particle f of mass 
m( + m 2 whose locus is a circular region Si . Then the circle S( is larger than 
C[; this follows from the fact that there are two circles through 2 tangent to 
S{ , C{, C 2 , and having the same kind of contact with all three of these circles; 
neither intersection of those two tangent circles with each other separates 
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any two of the points of tangency of the circles S( , C( , C 2 . The locus of f is 
the interior and boundary of S{. If n &gt; 2 , we now replace f and the particle 
S" 3 whose mass is m 3 and whose locus is C 3 by an equivalent particle f of mass 
?i + ?H 2 + ?3- The locus S 2 of f is the interior of a circle which does not 
contain z and which is larger than C 3 , by the reasoning just used. We con 
tinue in this way and finally replace all the particles representing the roots 
of /i by a single equivalent particle rji whose locus Si is larger than the circles 
C i (or if n" - 1 , equal to them). In any case, the region SL has at least one 
point in common with the region C [, so z is a point of the locus. 

Denote by 2i the larger and by S 2 the smaller of the two circles of the 
family S which are tangent to C and C" . If z is interior to S 2 , and if S 2 is 
exterior to C , C" , the reasoning just given applies with practically no change. 
If n" -- 1 , the locus of 171 which represents all the roots of /i is a region S\ 
whose bounding circle is larger than the circles C i , so the region Si must have 
at least one point in common with the region C i, and z is a point of the locus. 

Let us now consider a point 2 between Si and S 2 under the assumption that 
S 2 is not interior to the circles C i , C" . If //" = 1 , we find as before circles 
S[ , S z , , S], all larger than C{ . In fact, we need consider only points z 
interior to or on at most one circle C . Describe a circle S through z and 
through the points of tangency of Si with C{ and Co . When "i and " 2 lie at 
these two points, the point f corresponding lies on 2 , and is such that z and f 
separate fi and f 2 . Hence f is exterior to Si. Similarly there is a point f 
interior to S 2 , so S[ is indeed larger than C( . Thus we find z to be a point 
of the locus, for Si and C i have a common point. 

If n" &gt; 1 , we need consider only points z interior to at most one circle C\ 
and exterior to at most one circle C" . The circles S{, S 2 , , Si are all 
larger than C[ (or equal to C( if n = 1) . The region S" which is the locus 
of , the particle equivalent to the particles 1 and 2 whose loci are C i and Cl , 
is a circular region which contains all the region common to C" and C" . Hence 
the circle S i is smaller than C [. The region Si which is the locus of the 
particle r?i representing all the roots of /i is larger than each circular region 
C i . The region S 2 which is the locus of the particle rj 2 representing all the 
roots of / 2 is bounded by a circle smaller than the bounding circle of each region 
C" , so Si and 82 have at least one common point and z is a point of the locus. 

It remains to consider points z interior to S 2 , if S 2 is interior to C and C" , 
but this treatment is so similar to the results already given that it is omitted. 
It remains also to consider points t on Si and on S 2 , but since all other points 
of the plane are points of the locus and the locus of the roots of the jacobian 
is a closed point set, these points also belong to the locus. Theorem XI is 
now completely proved if the circles S have no point in common. 

14. Theorem XI, proof continued. We next undertake to prove Theorem 
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XI for the case that the circles S form a coaxial family of circles all tangent 
at a single point P, which point we transform to infinity. If none of the 
original circular regions C\, , C n &gt; , C [, , C" n &gt;, is the point at infinity, 
our results just proved for the case of circles S having no point in common 
hold without essential change; the entire plane is the locus of the roots of the 
jacobian. But P may be considered a null circle, the locus of a number of 
roots of /i and of / 2 ; in this case the entire plane need not be the locus of the 
roots of the jacobian. 

If all the roots of / 2 are concentrated at P , then either all the roots of /i 
are also concentrated at P and the locus of the roots of the jacobian is the 
whole plane, or there are a number of fixed equal circles C( , , C n &gt; bounding 
loci interior to them. In this latter case the field of force is precisely the field 
corresponding to Theorem X, so for this case Theorem XI is already proved. 
The case that there is at least one finite circle C" requires some further 
consideration. 

Denote by 2i and S 2 the lines which belong to the coaxial family S and which 
are tangent to all the circles C{, , C" n and transform so that Si and S 2 
are horizontal, with Si above S 2 . Any point z on the boundary of the locus 
of the roots of the jacobian must lie on one of the circles traced by the roots 
of the jacobian when the roots of the ground forms trace the boundaries of 
their respective loci all constantly lying on one variable circle S and tracing 
the circles C(, , C"" in the same sense; this is the location of the roots 
of the ground forms described in the statement of Theorem XL This fact is 
proved precisely as in 9, 10, if z lies on a circle S which actually cuts the 
circles C{, , C" n " We replace the particles at the roots of /i by a single 
equivalent particle T?I , the particles at the roots of / 2 by a single equivalent 
particle ?j 2 , and notice that when z is on the boundary of its locus the loci of 
T;I and ?? 2 cannot overlap. To complete the proof of Theorem XI in our 
special case it is sufficient to consider points z say above Si and to prove that 
all such points are points of the locus of the roots of the jacobian. 

If z is a point above Si and if there are two or more finite circles C , z is 
common to two or more of those circular regions and is therefore a point of the 
locus. If there is but one circle C" other than at infinity, further consideration 
is required. 

The locus of ?7 2 is the exterior of a circle S 2 obtained from C [ by similarity 
transformation with z as center, and Sz is farther from z than is C [. Thus 
if there is but one finite circle C( , the locus of 771 is the interior of a circle S[ 
obtained from ( by similarity transformation with z as center, and the loci 
S{ and S 2 must have at least one common point, so 2 belongs to the locus. 

If there are two finite circles C\ , we replace the particles whose loci are C( 
and C 2 by a single equivalent particle whose locus is S( , and then replace that 
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particle and the particles at infinity which represent the roots of /i by a single 
equivalent particle whose locus is Si . There are two circles through z tangent 
to Si, C(, C 2 ; one of these circles contains Si, C{, C 2 and the other contains 
none of these circles. The external tangents to S( and C{ intersect on the 
horizontal line through z, the radical axis of the circles through 2 tangent to 
Si , C i, C 2 .* Hence there is a circle S tangent to Si and S 2 and such that 
S and Si have z as common external center of similitude. It follows that the 
regions Si and So have at least one common point. In fact, if there is no line 
through z which cuts both C { and Si , Si is entirely interior to S 2 . If there is 
a line through z which cuts both C [ and Si , a line through z and tangent to 
C i cuts Si and Si and lies wholly in S 2 . Thus 2 is a point of the locus. 

If there are three finite circles C\, we find Si as before; the external center 
of similitude of Si and C 3 lies on the horizontal line through z, so that line is 
the radical axis of the two circles through z and tangent to Si, S 2 , C 3 ; one of 
these tangent circles contains Si , S 2 , C 3 , and the other contains none of these 
three circles. Then the external center of similitude of S 2 and C 3 lies on the 
horizontal line through z, and as before we find that Si and So have at least 
one common point. This reasoning is general for any number of circles C i . 

Every point z above Si and hence every point below S 2 is a point of the locus. 
We may show either by similar reasoning or as in 13 that every point on 
either of these lines belongs to the locus. 

Theorem XI is thus proved for circles S all tangent at a single point. It is 
worth while, perhaps, to point out explicitly that there actually exist situations 
with one or more circles C , and where the locus of the roots of the jacobian 
is not the entire plane. Thus, let there be merely two finite circles CJ and let 
2 be interior to both of them. Then Si is a region exterior to a circle which 
surrounds z . If 2 is exterior to all the circles C { and if the locus Si is interior 
to its bounding circle, it is possible so to choose the number of roots of / 2 at 
infinity that S 2 shall be the region exterior to a circle which entirely contains Si . 
Then Si and S 2 have no common point and 2 is not a point of the locus of the 
roots of the jacobian. 

15. Theorem XI; completion of the proof. The case that the circles S of 
Theorem XI form a coaxial family of circles through two distinct points P 
and P remains to be dealt with. Transform P to infinity. The points P 
and P are considered as null circles and hence allowed to be loci of a number 
of roots of /i or/ 2 or both. As in 8 we may assume that there is at least one 
circle C or C" distinct from P and P . 

If the circles C{ , , C" n " surround P , Theorem XI can be proved precisely 
as in 8-10. If C i, , Q" n do not surround P , these same methods show 
that no point z is on the boundary of its locus unless 2 is on one of the circles 

* Coolidge, A Treatise on the Circle and the Sphere, p. Ill, Theorem 217. 
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described in the theorem, provided that there is a circle S through z which 
actually cuts all the circles C(, , . If all the finite regions C(, , 
C"" are interior to their bounding circles, the theorem is Theorem VI and com 
pletely proved. If two or more of these regions are exterior to their bounding 
circles, every point z not on a circle S which cuts all the circles C{, , C" n 
is a possible position of pseudo-equilibrium and hence a point of the locus. 
It remains to consider the case of such points z with merely one finite region, 
say C( , exterior to its bounding circle. Let the line of centers of the circles 
C i, , C n" be horizontal and denote by 2i and S 2 the common tangents to 
CI , , C"n" . Let Ci lie to the left of P . We shall phrase the proof for 
ri &gt; I , n" &gt; 1 . 

The particles fi and f 2 wiiose loci are C( and C 2 are to be replaced by a 
particle " whose locus is a circular region Si . There are two circles through z 
tangent to C{ , C 2 , 8{ , one of which includes C( but not C 2 , the other of which 
includes C 2 but not C{ . If the locus Si is not the entire plane, it follows from 
a simple consideration of points "1 &gt; fo , T n the circle through z orthogonal to 
C i and C 2 that these two tangent circles include Si and exclude Si respectively. 
If the locus S( is the entire plane, the loci S 2 , S 3 , , Si are all the entire plane 
and z is a point of the locus. 

These two tangent circles intersect on the line Pz,* and the circle Si lies to 
the left of Pz . The external center of similitude of C{ and Si and the internal 
center of similitude of C 2 and Si lie on Pz . It is thus true that the external 
center of similitude of Si and any circle lies on Pz and that the internal 
center of similitude of Si and any circle C\ other than C( lies on Pz. 

We now replace f and fo , the particle whose locus is C 3 , by a single 
equivalent particle " whose locus is a circular region S 2 . If the locus S 2 is 
not the entire plane, there are two circles through z which intersect on Pz and 
which are tangent to Si , C s , S 2 , one of these tangent circles contains Si and 
S 2 but does not contain C 3 , the other contains C 3 but neither Si nor S 2 . We 
continue in this way and finally reach a circle Si which bounds the locus of 
the point T?I which represents all the roots of /i; the locus of 771 is exterior to Si . 
The external center of similitude of Si and any of the finite circles (and 
also of C( ) lies on Pz , and the internal center of similitude of Si and any of 
the finite circles C except C\ lies on Pz . 

Similarly the particles representing the roots of / 2 are replaced by a single 
equivalent particle j? 2 whose locus is the interior of a circle 82 such that the 
external center of similitude of S 2 and any of the finite circles C 1 lies on Pz. 

Hence the external center of similitude of Si and S 2 lies on Pz , from which 
it follows as before that there is at least one point common to Si and S 2 , so z 
is a point of the locus. Likewise all points of Si and S 2 are points of the locus. 

* Coolidge, loc. cit. 
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As in 14, cases actually arise here where all the regions C{ , , ( "" 
are not within their finite bounding circles and yet the locus of z is not the 
entire plane; the proof is as in 14. 

The number of roots of the jacobian in a region Ci which has no point in 
common with any other region d which is a part of the locus of the roots of 
the jacobian can be determined as in 10 for Theorem VI; the proof of 
Theorem XI is now complete. 

The determination in Theorem XI of whether or not a given circle d is 
actually a part of the boundary of the locus of the roots of the jacobian, and 
if so whether the circular region corresponding lies interior or exterior to d, 
can be made in any given case by the methods developed in the present chapter. 

Theorem XI has obvious applications which will easily be made by the reader 
to the study of the location of the roots of the derivative of a polynomial and 
of the derivative of a rational function. 

CHAPTER III: ON CENTERS OF GRAVITY 

16. The loci of certain centers of gravity. There is a striking analogy be 
tween some of our results concerning the location of the roots of the jacobian 
of two binary forms and results which are easily proved concerning the location 
of the center of gravity of a number of particles. Thus, the fact that if a 
number of positive particles lie in a circle their center of gravity also lies in 
that circle is analogous to Lemma I (II, p. 102) and was used in the proof 
of that lemma, and is also analogous to the theorem of Lucas. From this fact 
and Theorem VIII of III we prove the analogue of a theorem given in II, p. 115 
( = Theorem I of S) precisely as that theorem was proved : 

THEOREM XII. // the interiors and boundaries of two circles C\ and C 2 of 
centers a\ and a z and radii TI and r 2 are the loci respectively of mi and ?&gt; unit 
positive particles, then the locus of the center of gravity of these particles is the 
interior and boundary of the circle C whose center is 

mi ai 4- m-i a 2 



in\ + m 2 
and whose radius is 

mi ri + m 2 r 2 
m\ + ni 2 

The three circles d,C 2 ,C have as common external center of similitude the point 

TI a 2 r z ai 



Theorem XII can be largely extended by the method of proof used for III, 
Theorem VIII in S: 
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THEOREM XIII. // the interiors and boundaries of n circles d, whose centers 
are a; and radii r,- , are the loci respectively of m^ unit positive particles, then the 
locus of the center of gravity of these particles is the interior and boundary of the 
circle C whose center is 

miai + mzao + + m n a n 
a. = - 

mi + m-i + + in n 

and whose radius is 



m 



z ( / } - a,-| 



Denote the nii particles in or on d by z{ \ z ^ , , z ( m\, so that 
rg Ti for every i and j . The center of gravity of all the TOI + ; 2 + + m n 
particles is 



_ 



TOI + TO2 + + m n 
so that we have 



+ [(zi n) ) 

z a - 



and hence z is on or within C . 

Conversely, if z is given on or within C , we determine z j } by the relation 

T 

2J on = (z a) &gt; 

and we have the 2y satisfying the proper conditions. The proof is thus 
complete. It may be remarked that when the zj trace their proper circles 
in such a manner that (zj a)/ri is the same for every i and j , then this 
common value is equal to (z a)/r while z traces its circle C . 

Theorem XIII can be extended without difficulty in various directions: 
to particles of negative or even complex mass; to space of any number of 
dimensions; to give a result which shall be invariant under linear transforma 
tion; to regions other than the interiors of circles, especially convex regions. 
In this last extension, use is made of the fact that if m f particles lie in a convex 
region their center of gravity also lies in that region; hence such results as 
III, Theorem IX can be applied. 

There is much more than a mere analogy between Theorems XII and XIII 
for centers of gravity and our previous results concerning the derivatives of 
polynomials. In fact, the only root of the (n -- 1 )st derivative of a poly 
nomial of degree n lies at the center of gravity of the roots of that polynomial. 
When viewed in this light, Theorems XII and XIII are results relating to the 
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location of the roots of the derivatives of a polynomial even if not of the 
jacobian of two binary forms, and are conceived in precisely the same spirit 
as is Theorem IX. Thus the entire discussion of 5 holds practically without 
change if we consider the problem of determining the locus of the roots of the 
fcth derivative of a polynomial of degree n whose roots have certain assigned 
circular regions as their loci. Theorem XIII gives the complete solution of 
that problem for k == n -- 1 if the assigned circular regions are interior to their 
bounding circles. 

As a particular case of Theorem XIII, the fact that if a number of particles 
lie in a convex region their center of gravity also lies in that region follows 
from the theorem of Lucas* as applied successively to the various derivatives 
of a polynomial. 

17. The center of gravity of the roots of the derivative of a rational function 
and of the jacobian of two binary forms. The center of gravity of any set of 
points has interesting properties with reference to that point set. It furnishes, 
for example, an approximate idea of the location of those points. Any line 
through the center of gravity either passes through all the points of the set or 
separates at least two of them.f We shall now find some results connecting 
the centers of gravity of related polynomials of the sort we have been con 
sidering. A classical theorem of this nature follows from a remark previously 
made: 

THEOREM XIV. The center of gravity of the roots of a polynomial coincides 
with the center of gravity of the roots of the derived polynomial. 

We derive the corresponding result for a rational function, which we take 
in the form 

= 

1 



x m + a x m ~ l + 01 a; 7 "- 2 + 


+ m-l 


x n + b x n ~~ l + bi x n ~- + 


+ 6-i : 


O" + b x n " 1 + ) ( i.r "~ l 


4- (i -- l)a .r m - 2 + ) 


~ \ iju |~~ (*-Q (/ 


1 1 T^ 1 ^L [ y) 1 | /),, -&gt;1 - I ... 
y ^ ftX |^ ^ tv J UQ .(. 



,,, , 
; x x ..- 

If we denote by a the center of gravity of the finite roots of / ( .r ) and by /3 
that of the finite poles of /(.r), if m ^ n and if /(.r) has no finite multiple 
poles, we have for the center of gravity of the finite roots of/ (.r) the formula 

(m n -- 1 ) GO + ( m n + 1 ) b 



7 = 



( in -\- n 1 ) ( TO n ) 

m, ( m, n 1 ) a + n ( m n + 1 ) /3 
(m + n - - I) (m -- n) 



* On the other hand, Lemma I (II, p. 102) enables us to use this fact to give immediately 
a very simple proof of the theorem of Lucas. 

f An application of this fact to the more precise location of the roots of algebraic equations 
is given by Laguerre, (Euvres, vol. I, pp. 56, 133. 
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which is a point collinear with a and (3 . If a and j3 coincide, 7 coincides with 
them; if m -- n -\- 1 , 7 coincides with /3; if m == n -- 1 , 7 coincides with a. 
If n = , we have Theorem XIV. If m = n , 7 cannot be expressed in 
terms of merely a and /3 , as is found simply by computing 7 . 

Let us inquire in what respect this work on centers of gravity can be made 
invariant under linear transformation and can be applied to the jacobian of 
two binary forms. 

The concept center of gravity is surely not invariant under linear transforma 
tion. In fact, given any two distinct points of the plane and r? , any third 
point f of the plane can, by a suitable transformation, be made to correspond 
to the center of gravity of the transformed and rj . We need simply to trans 
form to infinity the harmonic conjugate of f with respect to and 77 . 

We cannot expect to obtain results with the ordinary definition of center 
of gravity, so we introduce a new definition. The point G is said to be the 
centroid of a set of points with respect to P if when P is transformed to infinity 
G transformed into the center of gravity of the points corresponding to the 
original set. We suppose that P is not a point of that set. It should be 
noted by way of justification of the definition that the point G is uniquely 
defined, since the center of gravity is invariant under similarity transforma 
tion. The relation between the points P and G is not reciprocal. 

The centroid with respect to a point of a set of points gives a rough indication 
of the distribution of that set of points, like the ordinary center of gravity. 
In particular, if P is external to a circular region containing the set of points, 
G is also in that circular region. In fact, examination of the proof of Lemma 
I (II, p. 102) will show that the force at a point P external to a circular region 
C due to k particles in C is equivalent to the force at P due to k particles 
which coincide at a point Q in C , and Q is the centroid of the k particles with 
respect to P . Thus we are studying the relation between P , Q , and the k 
particles, which is the same as Laguerre s relation set up between those points, 
referred to in 2. 

Let /i and / 2 be two binary forms, of respective degrees p\ and p z , and let 
the point P at infinity be a A-fold root of j\ . Let a, /3, 7 be the centroids 
with respect to P of the roots of /i other than P , the roots of / 2 (all of which 
are supposed finite), the finite roots of the jacobian of fi and / 2 , respectively. 
We easily find that 

( Pl - k) ( k + 1 ) a - (pi - kp z ) fl 
k ( Pl + p -k -I) 

a point collinear with a and 0. If a and /3 coincide, 7 coincides with them; 
if p l = kp z , 7 = a; if pi = k, 7 = /3, which is Theorem XIV. Always we 
shall have* 

* It might seem at first sight that this cross ratio should be pi I k , since by Lemma II 
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which expresses the entire result in invariant form. 

CHAPTER IV: ON THE ROOTS OF THE JACOBIAN OF TWO REAL FORMS 

18. The locus of the roots of the derivatives of a polynomial whose roots 
are real. The present chapter is devoted mainly to general theorems of the 
kind developed in Chapter II, but where we restrict ourselves to ground forms 
whose coefficients are real or can be made real by suitable linear transformation. 
We are placing additional restrictions on our ground forms, so it is to be ex 
pected that some additional properties will appear. 

Any result concerning the location of the roots of the derivative of a poly 
nomial is also a result concerning the roots of the jacobian of two binary forms. 
Thus all the facts proved in A can be given this interpretation and other results 
can be found by linear transformation.* The reader can easily formulate 
these new theorems. We now prove a m-\v result concerning the derivatives 
of polynomials all of whose roots are real. 

THEOREM XV. Let intervals Ii (i == 1, 2, , in ) of the axis of reals, 
whose end points are a,, ft, on = ft, be the respective loci of nii roots of a -poly 
nomial f (z) which has no other roots. Then the locus of the roots of f m (z) is 
composed of a number of intervals I\ k) of the axis of the reals. The left-hand end 
points of the intervals I\ k) are the roots of f (k} (2) when the roots of /(z) are con 
centrated at the points a,-; the right-hand end points are the corresponding roots of 
f m (z) when the roots off (2) are concentrated at the points ft. Any interval 
7j ft) which has no point in common with any other interval Ij k} contains a number 
of roots of f (z) equal to the multiplicity of its left-hand end point as a root of 
/ (t) (2) when the roots of /(z) are the ]&gt;//ints a, . // the internals 7, are all of 
equal length, the intervals I ( t k) are of this same length. If there is a point P which 
is a center of similitude for every pair of the intervals Ii (which is always true if 
m = 2 ) , P is also a center of similitude for every pair of intervals 1^ , 7j* . f 

We prove this theorem under the assumption that no interval I { reduces to 

(II, p. 102) when the pi k finite roots of .A coalesce at a and the p^ finite roots of / 2 
coalesce at ft there is but one position of equilibrium, namely, at the point 7 such that (P, 
a , , 7 ) = pi/k. However, the jacobian vanishes not only at 7 but also at a and j3 if pi fc 
and p 2 are greater than unity. It is the centroid with respect to P of all the finite roots of 
the jacobian that we have denoted by 7. The two formulas are the same when pi k = 1&gt; 
Pa = 1. 

*(Added in proof): There is an error in the statement of the italicized theorem of A, p. 
133, as has been pointed out by Nagy, Jahresberichtder Deutschen Mathe- 
matiker-Vereini gung . vol.31 (1922), pp. 245, 246. That theorem has no meaning 
as it appears at present, but becomes correct if the word exterior is replaced by the word 
other. The theorem is correctly stated in the abstract of A, Bulletin of the Ameri 
can Mathematical Society, vol. 26 (1919-20), p. 259. 

t Some special cases of this theorem are given by Nagy, Jahresbericht der 
Deutschen Mathematiker-Vereinigung, vol. 27 (1918), pp. 37-43; 44-48. 
The special case m = 2 , k = 1 is Theorem II of S. 
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a point; to include this more general case requires merely a slight change in 
phraseology. We prove the theorem first for the case k == 1 . In the theorem 
the intervals are assumed to be finite, but the theorem can be extended to 
include infinite intervals. 

Let us denote by a{* ) the roots of / (A ( z ) when the roots of / ( z ) are con 
centrated at the points on , a( k) i af when i &lt; j , and similarly by ft" the 
roots of f (k) (z) when the roots of /(z) are concentrated at the points /3,, 
ft" ^ $*&gt; when i &lt; j. The intervals /{*&gt; : ({*&gt; , /3j*&gt; ) are then to be proved 
to form the locus of the roots of / w ( z ) . 

Let us start with the roots of / ( z ) concentrated at the points at , and move 
these roots continuously to the right until they reach the points /3; . The 
roots of/ (z) also vary continuously in their totality; they start at the points 
a i and reach the points j3 i . If we number these roots, commencing at the 
left, we can even say that the nth root z n of / ( z ) varies continuously. We 
now prove that z n moves always to the right. 

The equation determining z n is of the form 

(3) F 



SB -" 7l 2 n ~- 72 %n 7m 

where the 7; are the roots of /(z) , coinciding in any multiplicities ?/&,- desired. 
We compute the values 

dF m\ m-i m m 



df (I \2 / \2 / \2 

2 (,Z n -- 7l) (Zn ~ 72 J (Z n ~~ 7m j 

dF_ mj 

dy~i~ (z n-ji) 2 

It is always true that dz n /dyi is positive, so z n always increases with 7; . 

Equation (3) is no longer valid to determine z n if z n is located at a multiple 
root of / ( z ) . Lender these circumstances, if 7* does not coincide with z n , 
the motion of 7, does not change the position of z n . If 7; does coincide with 
z n and if 7, is moved to the right, z n is either unchanged or moved to the right; 
this follows immediately from the fact that a A -fold root of / ( z ) is a (k 1 )- 
fold root of / ( z ) and from the fact that every interval bounded by roots of 
/ ( z ) contains at least one root of / ( z ) . 

From the general fact, then, that the ?ith root z n of/ (z) varies continuously 
and in one sense under the indicated variation in the roots of f (z) , it follows 
that z n traces the entire interval from a n to n . It remains to be shown that 
z n can never be outside of the interval ( a n , @ n ) . If we assume z n to lie outside 
of that interval, say for definiteness to the right, for some possible position of 
the roots of / ( z ) , motion of those roots of / ( z ) to the right always within 
their proper loci would move z n to the right and when the roots of / (2) reached 

Trans. Am. Math. Soc. 5. 
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the ends of their proper intervals z n would lie to the right of P n , which is 
impossible. 

The determination of the locus in Theorem XV is now complete for k = 1 ; 
the statement relative to the number of roots of / (z) in the various intervals 
is readily proved by the continuity methods previously used. 

For the case of k = 2 , the continuity of the motion of the roots of /" ( z ) 
due to the motion of the roots of / ( z ) shows that every point of each of the 
intervals 7" is a root of/" (z) for some/(z) . No other point can be a root 
of/" (z) , for when the roots of /(z) vary continuously in one sense, the roots 
of / (2) and therefore of /" (z) vary continuously in that same sense. The 
number of roots of /"(z) proper to the various intervals is as indicated. 
Continuance of the method of reasoning enables us to determine the locus 
for k = 3 and so on for the other values of k . 

If all the intervals /; are of the same length, the/(z) whose roots are the /3,- 
is obtained from the/(z) whose roots are the a,- by a translation, so the j3,- 
are obtained from the corresponding en by the same translation and the / 
(and hence the I ) are all of the same length as the 7; . If the /?,- are obtained 
from the a,- by a similarity transformation, the (3 are obtained from the 
a{ k} by the same transformation. 

19. The extension of theorems for the derivative of a polynomial to the 
roots of the jacobian. Theorem XV cannot be immediately extended to the 
location of the roots of the jacobian of two binary forms, where the loci of the 
roots of both forms are intervals of the axis of reals. First, all the roots of both 
forms may coincide, so that the locus of the roots of the jacobian is not a 
number of intervals of the axis of reals. Second, the jacobian may have 
non-real roots even when it does not vanish identically.* 

We can avoid this first possibility by requiring that the loci of the roots of /i 
and /2 be so arranged that the two forms cannot be identically equal. We 
can avoid the second possibility by requiring that these loci be so arranged 
that no two roots of /i can separate two roots of /&gt; . Then all the roots of the 
jacobian are real, for on any interval bounded by roots of either form and 
containing no root of the other form there lies at least one root of the jacobian. 

With these new restrictions, Theorem XV extends directly to the jacobian 
of two binary forms. If all the intervals which are the loci of the roots of 
both forms are finite, we consider the en (/8,-) to be at the left-hand (right- 
hand) ends of those intervals which are loci of the roots of /i and at the right- 
hand (left-hand) ends of those intervals which are loci of the roots of / 2 . 
For infinite intervals this notation is reversed. The locus of the roots of the 
jacobian is composed of the intervals whose end points are the corresponding 

* This is shown by the simplest examples, such as/i = z\ z\ , ft = ZiZ 2 , J = 2 ( z\ + z| ) . 
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roots of the jacobian when the roots of the ground forms are respectively the 
on and the j3, . 

A special case of this result is so similar to Theorem II that it deserves to 
be stated explicitly: 

THEOREM XVI. Letfi andf 2 be binary form, ? of degrees pi and p 2 respectively, 
and Jet arcs AI, A 2 , A 3 of a circle C be the respective loci of m roots offi, the 
remaining pi ~ ? roots of f l , and all the roots of / 2 . Suppose A z to have not 
more than one point in common with AI nor with A 2 and no point in common with 
both AI and AI . Denote by A the arc of C which is the locus of points z^ such that 

(21, z 2 . z 3 , z 4 ) = 
m 

when zi , z 2 , z 3 have the respective loci AI, A 2 , A 3 . Then the locus of the roots of 
the jacobian of fi andfz is composed of the arc A 4 together with the arcs AI, A 2 , 
A 3 , except that among the latter the corresponding arc is to be omitted if any of the 
numbers m, p\ - m, p 2 is unity. If an arc A f (i == 1 , 2, 3, 4) has no point 
in common with any other of those arcs ichich is a part of the locus of the jacobian, 
it contains precisely m -- 1 , pi m -- I , pi I , or I of those roots according 
as i = 1 , 2 , 3 , or 4 . 

Theorem XVI, as a special case of our more general result on the location 
of the roots of the jacobian, needs no separate proof, but it is interesting to 
notice that it can be proved in precisely the same manner as Theorem II was 
proved. Theorem I in the proof of Theorem II is replaced by III, Theorem 
IV, and Lemma I (II, p. 102) is replaced by the following 

LEMMA. The force at a point P on a circle C due to k unit positive particles 
lying on an arc A of C not containing P is equivalent to the force at P due to k 
coincident particles lying on A . 

We shall now obtain a result which has some relation to Theorem XVI 
as well as to Jensen s theorem, proved in A. We are dealing with pairs of 
points inverse with respect to a line, and as in A shall term circles whose diam 
eters are the segments joining such pairs of points Jensen circles. Let f\ and 
/ 2 be two real forms which have not necessarily all their roots real. Let finite 
or infinite segments I\ , 7 2 , /a of the axis of reals either contain respectively m 
roots of /i , the remaining p\ -- m. roots of j\ , and the p 2 roots of /2 , or contain 
some of these roots and the intercepts on the axis of reals of the Jensen circles 
of the remainder. Then any real root of the jacobian of /i and / 2 which is 
exterior to /i , I 2 , and 1 3 lies in the interval / 4 which is the locus of the point 
z 4 defined by 

(zi, 3 2 , 23, 24) = 
m 

when /i , 7 2 , h are the respective loci of zi , z 2 , 23 
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To prove this result we require the preceding lemma and the fact that the 
force at a point due to two particles is equivalent to the force at that point 
due to two coincident particles situated at the harmonic conjugate of that 
point with respect to the other two. If a point P is exterior to /; , its harmonic 
conjugate with respect to two points the intersections of whose Jensen circle 
with the axis of reals lie in / , also lies in /; . 

This result is not expressed so as to be invariant under linear transformation, 
for if a real linear transformation is made and if the point at infinity is not 
invariant the Jensen circles are not invariant. 

Our final result on the roots of the jacobian is similarly not invariant under 
linear transformation; it can be proved from the fact proved in A, 2, that 
the force due to two positive particles at a point above the axis of reals but 
interior to their Jensen circle has a component vertically downward; at a 
point above the axis of reals but exterior to the Jensen circle the force has a 
component vertically upward. 

THEOREM XVII. If the forms fi and fa are both real and if fi has no finite 
real root, there is no root of the jacobian of fa and fa exterior to all the Jensen circles 
corresponding to the roots of fa but interior to all the Jensen circles corresponding 
to the roots of f\ . 

20. Conclusion: extension of results to other types of polynomials. We 
have considered in this paper generalizations of Theorem II in various direc 
tions. There is still another direction which we have not mentioned, namely, 
to the roots of polynomials other than the jacobian of two binary forms or the 
derivatives of a polynomial. 

Thus the jacobian of two forms fa and fa , of respective degrees pi and p z , 
all of whose roots are finite and which correspond to two polynomials 0i 
and 02 , has the same roots as the polynomial 

Pi &lt;/&gt;! 02 Pi &lt;t&gt;l 02 

If we set 0i equal to the product of two polynomials \f/i and fa of respective 
degrees m and pi m, Theorem II refers to the roots of the polynomial 

(4) p 2 fa fa 2 + p 2 \[/i 1/4 02 - Pi &lt;Al fa 02 , 

when the roots of \f/i , fa , 2 have the respective loci C\, C z , C 3 . 

We shall generalize Theorem II by considering three polynomials wi , w 2 , w 3 , 
of respective degrees m, n z , Ms, whose roots have the respective loci C 1} C z , 
C 3 . Our conclusion concerns the polynomial 

(5) AI u&gt;i o&gt;2 us + A 2 ui o&gt; 2 w 3 + AS o&gt;i w 2 0*3 , 

where AI , A 2 , A 3 are real* numbers not all zero such that 
AI MI + A 2 M2 + AS jic 3 = . 

* Proof of Theorem I for complex X enables us to remove this restriction of reality. See 
Walsh, Rendiconti del Circolo Matematico di Palermo, vol. 46 (1922), pp. 
236-248. 
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It will be noticed that the polynomial (5) is indeed a generalization of (4), 
and has the additional advantage of being symmetric in wi , coo , cos . 
If a point z is a root of (5) yet exterior to C\ , C 2 , C z , we must have 

, o;i cog a4 n 

AI + A2 -r AS U, 



_j_ 2_2_ , 3j3 = Q 

2 a\ z ct.1 2 0:3 

by Lemma I (II, p. 102), where a\, a z , &lt;x 3 lie in d, C 2 , C 3 respectively. 
Hence z is given by the cross ratio 

/ \ ^3 Ms 

(a : , a 2 , a 3 ,z) = - -- , 

AlMl 

and lies in the region d of Theorem I corresponding to the value 

X = - ^ 3M3 . 
AlMl 

We leave it to the reader to verify that the locus of the roots of (5) is com 
posed of (7 4 together with the regions C\ , C 2 , C* 3 , except that among the latter 
the corresponding region is to be omitted if any of the degrees n\, y., us is 
unity. If a region C ( i = 1 , 2 , 3 , 4) has no point in common with any 
other of those regions which is a part of the locus of the roots of the jacobian, 
it contains precisely /zi -- 1 , nz -- 1 , Ms " 1 &gt; 1 of those roots according as 
i = 1,2,3,4. 

Many of the other theorems of the present paper, such as Theorems VI-XI, 
can similarly be extended to polynomials other than the jacobian of two binary 
forms or the derivative of a polynomial. Modifications of the methods used 
here can be made to apply to a still much broader type of polynomial about 
which the writer hopes to give some further results. 
CATONSVILLE, MD. 



/-CONJUGATE OPERATORS OF AN ABELIAN GROUP* 

BY 

G. A. MILLER 

I. INTRODUCTION 

Two operators of any group G are said to be /-conjugate if they correspond 
in at least one of the possible automorphisms of G. Every characteristic 
subgroup of G includes all the /-conjugates of each of its operators, and if a 
subgroup includes all the /-conjugates of its operators it is characteristic. 
In the present article it will be assumed that G is abelian. As two operators 
of any abelian group are /-conjugate if their prime power constituents have this 
property, and vice versa, it will only be necessary to consider the case when 
the order of G is of the form p m , p being a prime number. Hence this will 
be done in what follows unless the contrary is stated. 

Two fundamental questions in regard to the /-conjugate operators of G are: 
how many sets of /-conjugate operators are there in any abelian group, and 
how many operators are found in each of these sets? Both of these questions 
are answered in what follows, and the method for determining these numbers 
which is developed here seems to be as direct as possible. It is evident that 
every two /-conjugate operators are of the same order, and that a necessary 
and sufficient condition that every two operators of G which are of the same 
order be also /-conjugate is that all the invariants of G be equal to each other. 

Two definitions of independent generators of G are in common use. Accord 
ing to one of these definitions the operators Si, s 2 , , s\ are called a set of 
independent generators of G whenever they satisfy the two conditions that 
they generate G and that no X -- 1 of them generate G. The number X is 
know r n to be an invariant of G. According to the second definition, these X 
operators must satisfy the additional condition that the subgroup generated 
by an arbitrary subset of them have only identity in common with the sub 
group generated by the rest of these operators. According to the first defini 
tion, all the operators of G which do not appear in any one of the possible sets 
of independent generators of G constitute a subgroup of G known as its 
(^-subgroup, while these operators constitute such a subgroup according to the 
second definition when and only when the ratio of the largest invariant to the 
smallest invariant of G does not exceed p . 

* Presented to the Society, December 30, 1920. 
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To distinguish between sets of independent generators satisfying the first, 
or also the second, of these definitions, the latter are called reduced sets of 
independent generators. The former sets of independent generators are usually 
the most convenient when only questions relating to subgroups are considered, 
while the latter are more convenient in the study of conjugacy. In the 
present article it will be assumed that the sets of independent generators under 
consideration are reduced sets unless the contrary is stated. It will be seen 
that all the operators of. G which do not appear in any such set generate a 
subgroup which includes all the independent generators of G except those of 
highest order and those whose order is equal to this highest order divided by p , 
if any of the latter exist. 

"When G has independent generators of different orders, its independent 
generators which are of the same order are evidently /-conjugate and can be 
selected from a set of /-conjugate operators of G which has no operator in 
common with the group generated by the remaining independent generators 
of the set. The latter independent generators can usually be selected in a 
large number of different ways and the subgroups which such operators gener 
ate may differ, but none of the subgroups can involve an operator of the set of 
/-conjugate operators from which the former independent generators must 
be chosen. 

The number of the subgroups of G which are separately generated by all 
the independent generators of G which are of the same fixed order in its 
various possible sets of independent generators can easily be determined. In 
fact, it is the quotient obtained by dividing the number of ways in which the 
independent generators of such a subgroup can be selected from the operators 
of G by the number of ways in which these generators can be selected from the 
operators of one of these subgroups. If p a is the order of such an independent 
generator the totality of the operators of order p , = /3 == a, contained in 
all of these subgroups constitutes a single set of /-conjugate operators of G. 
Hence the distinct operators in all of these subgroups constitute the operators 
of a. /-conjugate sets of G excluding identity. Two such subgroups corre 
sponding to different values of a can have only identity in common, and G is 
the direct product of an arbitrary set of subgroups such that one and only one 
of the subgroups of this set corresponds to a particular possible value of a . 

II. /-REDUCED OPERATORS OF A GROUP 

It has been noted that when G contains a set of independent generators com 
posed of Xi operators of order p a i , X2 operators of order p" 2 , , X 7 operators 
of order p-r , so that 

then the number of its sets of /-conjugate operators which are separately 
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powers of possible independent generators is i + &gt; + + a y , exclusive 
of identity. Each of these sets contains one and only one operator which 
satisfies both of the following conditions: It is the lowest possible power of an 
operator in the set of independent generators Si, s 2 , , s\ which appears in 
the former set, and this operator has the smallest possible subscripts. Such 
an operator will be called an I-reduced operator and hence each of the given 
sets of /-conjugate operators contains one and only one /-reduced operator, 
and this is a power of an independent generator of G . 

In general, an /-reduced operator is defined as the single operator of a set 
of /-conjugate operators of G which satisfies the following conditions: It 
involves powers of the smallest possible number of constituents which are 
separately powers of the operators $1, s 2 , , s\ for the set of /-conjugate 
operators in which it is found, each of these constituents is raised to the lowest 
possible power, and the subscripts of operators of the set s t , s z , s x of which 
these constituents are powers are as small as possible. A necessary and 
sufficient condition that an /-reduced operator involve powers of more than 
one of the operators s\ , s 2 , , *\ is that all these powers be of different orders, 
and that the larger of two generators involved be raised to a higher power of p 
than the smaller, and this power have a larger order than the power of the 
smaller. In particular, no two of these constituents are powers of inde 
pendent generators whose orders have a ratio which is less than i&gt; 2 . 

As each of the possible sets of /-conjugate operators of G is completely 
determined by the /-reduced operator which appears in the set, it results that 
the determination of the number of different sets of /-conjugate operators is 
equivalent to the determination of the possible number of different /-reduced 
operators. It should be noted that the number of /-reduced operators depends 
only upon the orders and the number of the different orders of the independent 
generators of G. That is, if G has more than one independent generator of 
the same order, the number of /-reduced operators of G is the same as that 
of the group having only one of these generators and only one generator whose 
order is equal to the order of every other independent generator of G . 

To determine the number of operators of G which are /-conjugate with a 
given /-reduced operator T of G, it is convenient to call ^-generators all the 
independent generators of G whose orders are equal to the orders of those 
independent generators whose powers appear in this /-reduced operator. 
The remaining independent generators of G will be called s-generators. Let 
P* 1 , P**, , P^ e be the indices, in descending order of magnitude, of the 
various powers of ^-generators which appear as constituents of T , and con 
struct a subgroup of G whose independent generators are powers of s-generators 
which are determined as follows: 

All the ^-generators of G whose orders exceed the order of the largest 
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^-generator are raised to powers such that the common order of these powers is 
equal to that of the p ftl power of this ^-generator, and all the s-generators whose 
orders are smaller than the smallest ^-generator are raised to the p ffe power. 
Each of the other s-generators of G is raised to the highest power whose index 
does not exceed the index of the power to which the next larger ^-generator 
is raised to obtain a constituent of T and whose order is not less than the 
order of the power of the next lower ^-generator which appears in T. The 
powers of the s-generators thus determined constitute a set of independent 
generators of the subgroup in question. 

To obtain all the operators of the set of /-conjugate operators of T , we 
multiply all the operators of the subgroup noted in the preceding paragraph 
by the product of the operators of highest orders in the 6 subgroups which 
are separately generated by the ?/ , p 1 *- , , p^ e powers respectively of the 
/-generators of the same order contained in G. As the invariants of each of 
these 6 subgroups are equal to each other, these powers for any particular 
subgroup are evidently /-conjugate, but the powers for one subgroup are not 
/-conjugate with the powers in question contained in another of these 6 sub 
groups. In particular, the number of operators in each set of /-conjugate 
operators of G besides identity is divisible by p 1 , as results also directly 
from the fact that an automorphism of an abelian group can be obtained by 
letting each operator correspond to any given power of itself whose index is 
prime to the order of the group. 

If the order of an abelian group is not a power of a prime number, the 
number of its sets of /-conjugate operators is evidently the product of the 
numbers of the sets of /-conjugate operators of its Sylow subgroups. In 
particular, it may be desirable to emphasize the theorem: The number of sets 
of I-conjiigate operators in any abelian group is equal to the product of the numbers 
of the I-reduced operators in its Sylow subgroups for a set of independent generators 
in which the order of each generator is a power of a prime number. In this 
theorem, identity is included among the /-reduced operators of a Sylow 
subgroup. 

For the purpose of illustrating the preceding developments, we shall con 
sider the special abelian group of order p w and of type (6, 3, 1). In addition 
to identity, the number of /-reduced operators involving a single constituent 
is 10, the number of those involving two constituents is 11 , and the number 
involving three constituents is 2 . Hence this group involves 24 sets of 
/-conjugate operators including identity. The numbers of /-conjugate 
operators in these 24 sets are as follows: 1 , p -- 1 , p~ - p , p s p 2 , p b p 4 , 
p 7 - p 6 , p w - p 9 , p z - p, p* - p 3 , p 7 - p 6 , p* - p z , (p 2 - p} (p - 1), 
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These numbers illustrate the obvious theorem that a necessary and sufficient 
condition that an operator of an abelian group of order p m , p &gt; 2 , having 
no two invariants which are equal to each other, be either a possible inde 
pendent generator or a power of such a generator is that the number of its 
/-conjugates be not divisible by (p 1 ) 2 . 

This theorem is evidently a special case of the theorem that a necessary and 
sufficient condition that the /-reduced operator of a set of /-conjugate operators 
involve powers of a operators of a set of independent generators of G is that 
the number of operators in this set be divisible by (p -- 1 )" for a general value 
of p . It should be noted that the number of /-conjugate sets of operators of 
a group of order p m depends on the type of this group, but is independent of 
the value of the prime number p, and that the theorem stated at the close 
of the preceding paragraph is not affected by the number of independent 
generators of the same order when G has a general value. For special given 
values of p, the theorem stated at the opening of the present paragraph is 
clearly not always valid. 

III. CRITERIA FOR /-CONJUGATE OPERATORS AND FOR CERTAIN /-CONJUGATE 

SUBGROUPS 

It was noted in the preceding section that there is one and only one 
/-reduced operator in every complete set of /-conjugate operators of the abelian 
group G of order p m , and that the number of constituents in terms of a fixed 
set of independent generators of G appearing in such an /-reduced operator 
can be determined from the number of operators involved in the set of 
/-conjugates to which this /-reduced operator belongs. A necessary and 
sufficient condition that two operators of G be /-conjugate is that they be 
/-conjugate with the same /-reduced operators. We proceed to develop 
another criterion for determining when two operators are /-conjugate. 

The cyclic group generated by an /-reduced operator gives rise to a quotient 
group which is known to be simply isomorphic with a subgroup of G. The 
s-generators of G and all the ^-generators of G with respect to this /-reduced 
operator except those whose powers actually appear in it can also be used as 
independent generators of the quotient group in question. To each of the 
latter ^-generators, except the smallest one, there corresponds a generator of 
this quotient group whose order exceeds the order of all these ^-generators 
whose order is less than that of the ^-generator in question. 

The quotient groups which correspond to two /-conjugate cyclic subgroups 
are evidently of the same type. To prove that, conversely, every two cyclic 
subgroups which give rise to quotient groups of the same type are /-conjugate, 
it should be noted that when these cyclic groups are replaced by those generated 
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by the /-reduced operators in the sets of /-conjugate operators to which their 
generators belong, their largest constituent groups with respect to the set of 
independent generators of G in question must be generated by the same power 
of independent generators of the same order, since, otherwise, in one quotient 
group the number of independent generators, beginning with the largest, 
whose orders coincide with those of G would ( differ from the number of the 
corresponding independent generators of the other quotient group. 

If the generators of the cyclic groups in question involve powers of more 
than one /-generator of G , the second ^-generator involved must again be the 
same for both of these cyclic groups, since the independent generators of the 
quotient group which corresponds to the first ^-generator are of a larger order 
than the second ^-generator, as was noted above. Moreover, the same power 
of this second ^-generator must appear in a generator of each of the two cyclic 
subgroups in question. As this process may be continued until all the 
^-generators whose powers appear in the constituents of the cyclic subgroups 
under consideration have been exhausted, there results the following: 

THEOREM. A necessary and sufficient condition that two operators of any 
abelian group be I-conjugate is that the cyclic groups generated by these operators 
give rise to quotient groups which are of the same type. 

It results directly from the preceding theorem that the number of different 
sets of /-conjugate operators can be determined by counting the number of 
different types of quotient groups to which cyclic subgroups of G give rise. 
For instance, the cyclic subgroups of the abelian group of order p* and of type 
(3, 1) clearly give rise to quotient groups of the following types, and of no 
other types: (3, 1), (3), (2, 1), (2), (1, 1), (1). Hence this group has exactly 
six sets of /-conjugate operators, including identity. The number of operators 
in these sets is 1 , p- - p , p -- I , p (p I ) 2 , p 2 - p , p 4 p 3 , respectively. 
All of these operators are either possible independent generators or powers of 
such generators except those of the fourth set. 

As a first step in a proof of the theorem that if two subgroups of the same 
type give rise to cyclic quotient groups they must be /-conjugate, it will be 
convenient to consider a necessary and sufficient condition that a subgroup H 
of G give rise to a cyclic quotient group. If G/H is cyclic, and if as many as 
possible of the operators of a set of independent generators of G are selected 
from the operators of H , the remaining operators of this set can be chosen 

as follows: 

As one of the operators any operator s\ of lowest order contained in a 

co-set corresponding to any operator of highest order in G/H may be selected. 
A necessary and sufficient condition that Si be the only operator of the set of 
independent generators in question which does not appear in H is that one 
of the operators of smallest order in every co-set corresponding to an operator 
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of G/H be a power of Si . When this condition is not satisfied, find one of the 
largest operators of G/H such that a power of Si is not one of the smallest 
operators in the corresponding co-set. Let s* be any one of the smallest 
operators in such a co-set. It is evident that s 2 may then be chosen as a second 
operator of the set of independent generators in question. 

If the powers of s 2 are not operators of lowest order in the co-sets with 
respect to H in which they appear, we select an operator s 3 of lowest order in 
the co-set which corresponds to the largest operator of G/H to which such an 
operator corresponds but is not an operator of lowest order in the co-set. 
This process is continued until an operator s a is found such that its powers 
are operators of lowest order in all the co-sets with respect to // in which they 
appear. The operators Si , s 2 , , s a are then the operators of the set of 
independent generators in question which do not appear in H . It may be 
noted that the ratio of the order of any one of these operators and the order 
of the one which follows it in this sub-set cannot be less than p 2 . 

For the independent generators of // which are not also independent genera 
tors of G we may choose the operators 



where s f x (x -- 1 , 2, , a -- 1 ) is the inverse of the lowest power of s x 
which appears in a co-set with respect to // in which it is not an operator of 
lowest order, and s p a a is the lowest power of s a which is found in H . The 
order of s f must exceed the order of s x+ i , since the latter is an operator of 
lowest order in the co-set in which the former appears and is an independent 
generator which cannot be replaced by an operator of H . Hence it follows 
that a necessary and sufficient condition that a subgroup G give rise to a cyclic 
quotient group is that the a independent generators (si , s 2 , , s a ) of G which 
cannot be selected from H can be so chosen that 

af and s f*s z+ i (x = 1,2, , a - 1) 



are independent generators of H , lohere s f is of a larger order than s x+ i and 
p x &gt; . It should be noted that each of these independent generators of H 
is of a larger order than any of the independent generators of G whose powers 
appear in the succeeding independent generators of H . 

By means of the theorem of the preceding paragraph, it is easy to find a 
necessary and sufficient condition that two subgroups HI, H 2 of G which give 
rise to cyclic quotient groups be /-conjugate. It is evident that a necessary 
condition is that HI and H 2 be of the same type. To prove that this is also a 
sufficient condition, when it is assumed that both of the quotient groups G/Hi 
and G/H 2 are cyclic, let *i . s 2 , , s p and h , t 2 , , t p be two sets of inde 
pendent generators of G which have been so chosen that as many as possible 
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of these operators are selected from those of HI and HZ respectively. It 
results that the first operators of each of these sets, arranged in the descending 
order of magnitude, whose orders exceed the order of the corresponding reduced 
independent generator of HI and H 2 , respectively, arranged similarly, must 
have the same order. The largest independent generator of H] which is not 
also an independent generator of G has the same order as the largest independ 
ent generator of HI which is not also an independent generator of G , since the 
order of this independent generator must exceed the order of all the other 
independent generators of G which are not also independent generators of HI 
or HI . Hence it results that these independent generators of HI and HZ 
may be regarded as products of powers of independent generators of G which 
are of the same order and independent generators of next to the highest order 
which are found in G but not in the H s . 

Just as the operators of HI and // 2 which correspond to the largest inde 
pendent generator of G which is not also an independent generator of HI or 
HI can be chosen from the operators of G as s^ 1 s 2 was chosen, so the operators 
which correspond to the next to the largest independent generator of G which 
is not also an independent generator of HI or H 2 can be chosen in the same 
way as n f 2 s 3 was chosen, whenever not all the independent generators of G 
save one can be selected from the operators of H\ . Since these arguments 
apply to these successive independent generators, it results that every two 
subgroups of G which are both of the same type and give rise to cyclic quotient 
groups are I-conjugate. 

If HI and HI are two subgroups of the same type which give rise to two 
quotient groups of the same type, it does not necessarily follow that HI and 
H* are /-conjugate, as may be seen by considering the group G of order p 
and of type (5, 3, 1). If si , Sz , s s represent the three generators of G of orders 
p & , p s , and p respectively, and if *? s 2 , s? and Si , *? *3 a re the independent 
generators of HI and // 2 respectively, it results that the two quotient groups 
G/Hi and G/Hz are of type (2, 1), and the two groups HI , HZ are of type (4, 2). 
The latter groups cannot be /-conjugate, since the operators of the highest 
order in the latter are powers of operators of highest order in G , but this 
is not the case as regards the operators of highest order of the former subgroup. 

It may be noted that the sum of the number of independent generators of a 
subgroup of G plus the number of the independent generators of the quotient 
group corresponding to this subgroup is equal to the number of independent 
generators of G whose common order is p increased by a number which may 
vary from the number of independent generators of G whose orders exceed p 
to twice this number, but can have no other value. Both of the limiting values 
can evidently be actually attained, and the fact that this sum can have no 
other values results from the theorem that a quotient group of an abelian 
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group is always simply isomorphic with a subgroup of this group, and that the 
independent generators of a subgroup which gives rise to a cyclic quotient group 
can be selected as noted above. 

It was noted above that when two subgroups of the same type give rise to 
cyclic quotient groups they must be /-conjugate, and when two cyclic sub 
groups give rise to quotient groups of the same type they are also /-conjugate. 
The other extreme cases are when two subgroups of the same type give rise to 
quotient groups of type (1,1,1, ) and when two subgroups of type 
(1,1,1,- ) give rise to quotient groups of the same type. In each of these 
two cases the two subgroups in question are again /-conjugate. In the special 
case when a subgroup gives rise to a quotient group of type ( 1 , 1 , 1 , ) 
which involves as many invariants as G itself, the subgroup is characteristic, 
being the (^-subgroup of G. In this special case, the subgroup is completely 
determined by the type of the quotient group to which it gives rise. 

Every subgroup of G whic h gives rise to a quotient group of type ( 1 , 1 , 1 , 

) must include the (^-subgroup of G . If the ^-quotient group is of order 
p a and a subgroup H gives rise to a quotient group of order p? and of type 
( 1 , 1 , 1 , ) , it results that exactly a j3 of the independent generators 
of G are found in //, while the pth power of each of the other independent 
generators of G is found in this subgroup. From this it results directly that 
if two subgroups of the same type give rise to quotient groups of type (1,1,1, 
) these subgroups must be /-conjugate. The number of the characteristic 
subgroups which give rise to quotient groups of type (1,1,1, ) is evidently 
equal to the number of the different orders of reduced independent generators 
of G . As every operator of order p found in G is a power of a possible inde 
pendent generator of G , it results that when two subgroups of type ( 1 , 1 , 1 , 

) give rise to quotient groups of the same type their independent generators 
can be so chosen that they are powers of independent generators of G which 
are of the same orders. Hence these subgroups are /-conjugate. 

UNIVERSITY OF ILLINOIS, 
URBANA, ILL. 



GENERALIZED LIMITS IN GENERAL ANALYSIS, FIRST PAPER* 

BY 

CHARLES N. MOORE 

The analogies that exist between infinite series and infinite integrals are well 
known and have frequently served to indicate the extension of a theorem or a 
method from one of these domains of investigation to the other. According 
to a principle of generalization that has been formulated by E. H. Moore, the 
presence of such analogies implies the existence of a general theory which 
includes the central features of both the special theories, f It is the purpose 
of the present paper to develop the fundamental principles of that section of 
this general theory which contains as particular instances the theories of 
Cesaro and Holder summability of divergent series and divergent integrals. 
Furthermore, the usefulness of the theory will be illustrated by proving a 
general theorem in it which includes as special cases the Knopp-Schnee-Ford 
theoremf with regard to the equivalence of the Cesaro and Holder means for 
summing divergent series, an analogous theorem due to Landau concerning 
divergent integrals, and a further new theorem with regard to the equivalence 
of certain generalized derivatives. 

The general theorem just mentioned can be extended to the case of multiple 
limits so as to include other new theorems, analogous to those referred to above, 
with regard to multiple series, multiple integrals, and partial derivatives. 
This extension, however, involves formulas that are considerably more com 
plicated than in the case of simple limits. I shall therefore reserve it for a 
second paper, as I wish to avoid algebraic complexity in this first presentation 
of the general theory. 

Following the terminology introduced by E. H. Moore, [| we indicate the 
basis of our general theory as follows : 

(21; ty; @; onSto31 ; ^onitoJi. gonito*. ^ Jonto* :0 ntoS) 

* Presented to the Society, December 28, 1918. 

t Cf. E. H. Moore, Introduction to a Form of General Analysis, The New Haven Mathe 
matical Colloquium, Yale University Press, 1910, p. 1. 

t That the existence of the Holder limit implies the existence and equality of the Cesaro 
limit of the same order was first proved by Knopp; cf. his Inauguraldissertation, Grenzweite 
von Reihen bei der Annaherung an die Konvergenzgrenze, Berlin, 1907. The converse theorem 
was established independently by Schnee and W. B. Ford; cf. M a t he m a t i s c h e 
Annalen, vol. 67 (1909), pp. 110-125, and American Journal of Mathe 
matics, vol. 32 (1910), pp. 315-326. 

SeeLeipzigerBerichte, vol. 65 (1913), pp. 131-138. 

|| Cf. his two papers: On the foundations of the theory of linear integral equations, Bulletin 
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where 21 = [a] denotes the class of all real numbers a, ^ = [p] denotes a class 
of elements p , and @ = [ o- ] denotes a class of sets a of elements p of the range 
jp ; = [ 7 ] , = [n], and g = [ &lt;j&gt; ] are three classes of functions y , rj , and 
respectively on to 21 (we shall restrict ourselves throughout to the con 
sideration of single-valued functions); 4&gt;o is a special function &lt;/&gt; of the class $; 
and J is a function on to and on to g , that is a functional transforma 
tion turning a function of the class into a function of the class or a function 
of the class into a function of the class 5 &gt; denoted by Jj or Jj] . 

In order to make clear the relationship of our general theorem to the two 
special theorems referred to above we will indicate here what the general basis 
reduces to in the particular instances III and I\ : 

$ m s[allnl,2,3, ]; s= [o- n == (1, 2, -,n)n]; 

= == g== [all7, ?,0 on2toJl ]; &lt;fo((r n )==w (); 

(/7)(&lt;O == 7(&lt;M +7(0-2) + + 7(&lt;O (); 

(Ji?) (o- n ) =77(0-1) + -n(vi} + + 7?(0" n ) (n); 

&lt;$ IV == [alia &gt; 0]; - [&lt;r =E (all .r such that &lt; .r ^ a) (a &gt; 0)]; 
= [all functions that are finite and integrable (Lebesgue) on every finite 

interval ( &lt; x = a ) ] ; 



= Tall r, = (j"y 



o")= I 7(7); (Ji))(a)= I 

t/O - 



We next proceed to make certain postulates with regard to the nature of the 
elements in our basis, readily seen to be verified in the specific instances 
indicated. Thus we require the class @ to have the following properties: 

( U) Either corresponding to every a there exists a least common superclass 
of classes a &lt; a , or there exists a CT O such that for every a &gt; cr there 
exists a least common superclass of classes a &lt; a . In both cases the 
least common superclass is itself a a . 

(A) Corresponding to every a there exists a least common subclass of classes 
a &gt; a , this least common subclass being itself a a . 

In the typical instances in view in the formation of this general theory, of 
the two alternatives in ( U) one holds and the other does not hold; however, 
it is not assumed that this disjunction between the two alternatives shall be 

of the American Mathematical Society, vol. 18 (1912), pp. 334-362; On 
the fundamental functional operation of a general theory of linear integral equations, Proceedings 
of the Fifth International Congress of Mathematicians, Cambridge, 1913, pp. 230-255. 
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presupposed. In order to avoid notational reference to these alternatives, 
it is convenient to introduce a property of sets a of @; if the first alternative 
holds, every a of &lt;5 has the property - , in notation &lt;r; if the first alternative 
does not hold, the sets a are the sets a &gt; cr ; further for brevity we introduce 
a property (the negation of ) ; thus every a is a a or a cr . 
We now define 

aL\ = [the least common superclass of classes a &lt; a } (a 1 ) , 

a{ = [the least common subclass of classes a &gt; cr ] ; 

a n+ i = [the least common subclass of classes a &gt; a n ] (a n =- 1 , 2 , ). 
We then postulate 
(R) Corresponding to every a , a{ is a cr and there exists (cr\ )_i = a . 

We next define the notation a &lt; cr" , cr" &gt; cr , to mean that cr" contains 
all the elements of a and at least one element not found in a . We are then 
ready to formulate three limit definitions which are based on the fundamental 
definition of limit in General Analysis given by E. H. Moore.* 

For a given 6 on &lt;S to 21 , a given a , and a given a such that there exists 
a &lt; &lt;r , we shall write 

lim 6 ( cr ) = a 

a ] ff&lt;ff/ 

in the case that, corresponding to an arbitrary positive number e, there 
exists a &lt;r e &lt; a such that for every cr having the property cr e ^ a &lt; a , 
\6(a) - a\&lt; e. 

For any function 6 on to 21 we shall say that 6 ( a ) approaches a limit as to 
cr if corresponding to every positive e we can find a &lt;r e such that for every 
a &gt; cr e we have ( a ) a \ &lt; e . 

For any function 6 on @ to 51 we shall mean by the notation 

lim 6 (cr) = &lt;x 
c 

that for every positive e, there exists a &lt;r e such that for every a &gt; cr e ,6 (a) &gt; e . 
We define the notation 

(7)0)(O = a(cr) 
with regard to every 6 , a on &lt; to 51 , to mean that 

(1) 



&lt;PQ (cr ) 



* Cf. Proceedings of the National Academy of Sciences, vol. 1 (1915), 
pp. 628-632. 
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We require the class to have the linear property 
(L) as defined by E. H. Moore,* 
and the property ( P ) defined by 
( P ) The product 71 (a ) 7-2 (a ) is a function of the class @ . 

It will then follow that the product 71 ( a ) 72 ( a ) y n ( ff ) is a function 
of the class . 

We require the class to have the properties (L) and ( P) and the further 
property of being a subclass of the class , which property we designate as 
S a . We further postulate for the class the property (B) defined by 

(B) If lim f\ (a) exists and is equal to a, then \ri (ff) &lt; ai (ff) 

tr 

We require the class 5 to have the properties (L) and ( P ) and the further 
property of being a subclass of the class , which property we designate as S H . 
Hence J is also on 5 to 5 We also postulate for the class 5 the property 
(A) defined by 

( A ) There exists D&lt;f&gt; = [ ( Z)0 ) ( &lt;j ) ff } , a function of the class . 

We now define 

(2) 0(&lt;7)^&lt;^,(&lt;7_ 1 ) (ff), 0(&lt;r) = (a); (&lt;T) = 4&gt; Ui) (a); 
and with regard to the functions and we postulate 
( F ) All functions and are of the class 5 

We further postulate with regard to the class 5 
( C ) For every 4&gt; and every ff there exists lim 4&gt;(ff) = (V ) . 

ff|(T&lt;ff 

For the operation J we postulate the following properties: 
( Mi ) If Q! &lt; 71 &lt; cr 2 , S 72 , then 



) If for every ff &gt; ff , 0,1 &lt; 71 &lt; 2 , = 72 , then for ff" &gt; a 

(&lt;r )]^ (J[ 7 i 72] ) (&lt;r") - 



( I D ) For every 77 and every ff there exists (D(Ji)))(ff) = rj(ff) , 
(Ij) For every and every a there exists ( J ( D&lt;f&gt; )) (&lt;r) = 0(&lt;r). 
We next introduce for the sake of brevity the following notations: 

(3) 00n(&lt;T) = 0o(0-)-0o(&lt;Tl)-0o(0- 2 ) ..... 0oOn-l) ( rt &gt; 1 ) , 
_ 001 (0") = 00 ( 

* Loc. cit. 
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We then postulate with regard to &lt;o 

(I) 0o is a positive increasing function of a , 

(II) - ( J n 77 ) ( a ) as function of a is of the class $ ( n ) , 

&lt;POn (ff) 

the symbol J n 77 indicating that the operation J has been repeated n times, 

(III) 0o (0"n) as function of a is of the class ^ (n) , 

(IV) Iim0 0n (&lt;r) -oo ( n ), 

a 

(V) [ 0o ( 0"i ) 0o ( a ) ] is constant for all &lt;r , 

(VI) 00n(ff) = 00, n-l(ff) (&gt;!), 1 (ff ) = 00 (ff ) = 1 

We have then as the foundation of our theory: 

y = (W- ffi- (gt/^fl. (UonSto?l-Z,P. ^on Jto?l-iPScS. 

gon^toSl-iPSaCA. ^3 I II III IV V VI. ^*. ^S. 

J" on &lt;5&gt; to *&gt; on &gt;&gt; to 3 JA^z- b 7 j J 

We will now prove that the operation J , when applied to the class , has 
the linear property ( L ) . Let us set 

(Jr, n )(a) = B ((r) (?i = 1,2, -,;). 

It follows from the definition of D and / that 



= ai 771 + o 2 172 + + a ? 
Applying J to both sides and making use of Ij , we have 



= ai(Jrn)(ff) + 02(^2) (o") + + di 

We will now prove two properties of the operations D and J as applied to 
the class 5 We have for a ^ ff 



= 92 



\ \ . / /N , / &gt;. 

00 (ff) - 00(0") 0fl(ff ) - 00(0") 

By virtue of properties (C) and (A) of class ^ the right side of the above 
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equation approaches a limit as to a a &lt; a , for every a . Hence so also does 
the left side, and we obtain the formula 

(4) 



for a = a . In view of (1) and (2) equation (4) obviously holds for a = a . 

By virtue of postulates (A) and (F) and the properties (P) and (Z) of 
the class Jp we may apply the operation J to equation (4). On doing so we 
obtain, in view of Ij and the linear property of J established above, 

(5) (J[0 2 (Ztyi)])(&lt;O =tfi(&lt;O&(&lt;O " (&lt;/[^(A/&gt; 2 )])((7). 

Equation (4) includes as special cases the formulas for differentiation of a 
product and forming the first difference of a product. Equation (5) includes 
as special cases the formulas for integration by parts and partial summation. 

We shall next prove two further properties of the special function . The 
first of these properties is the following: 



(VII) 
We have, in view of the definition of D , ( V ) , ( R ) , ( C ) , and (2), 



\/J/ \ . f \ 

I . |_ (po(ff ) &lt;f&gt;o(ff ) 

= lim [ (f&gt;Q ( (7 ) n 

a I a &lt; a 

We now assume 



Then, making use of (4), we have 

(7) 



)K&lt;o, i-i(cri)] + i(ff) = (i + l)&lt;o;U) (a). 

Hence, if (VII) holds for &lt;? , n = i ( i ^ 2 ) , it will hold for a , n = i + 1 . 
Combining this fact with equation (6), we infer that (VII) holds for a] n ^ 2; 
for cr , n = 1 it is an obvious consequence of the definition of D . For a , n it 
follows at once from the definition of D and (VI). 

We now introduce the following notation: 

(8) *n(^ 



A . 

90, n-l ( &lt;r ) 

The second property of &lt; that we wish to prove may then be stated as follows: 
(VIII) 
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We have from the definition of D, (R) , (C) , and (2) 

1 1 

( D\[/&lt;&gt; ) ( cr ) = ( Z) -j(cr ) = lim 

/ O x \ &lt;/&gt;01/ a\f &lt;&lt;r r &lt;f)o(ff ) - 

(y) 

- 1 - 1 1 

llm .T"/-/^ ,_\==. /_/ x . ,.^~- -TTT-T C^J- 



Then, assuming (VIII) for ?i == i , we have from (4) and (9) 



(10) 



&lt;PO, t+l (0"-l) 

From (9) and (10) the proof by induction of formula (VIII) may readily 
be completed. 

We are now ready to define the two generalized limits with which we shall 
be concerned. Given any function 77 (cr) , we set 

(11) 
(12) 
(13) 

where (f&gt;o n ( a ) is defined as in equation (3) and C and H are used, as is cus 
tomary, to connote Cesaro and Holder. If for a fixed n lim ff ( C n ri ) ( cr ) 
exists, we define this limit as the generalized limit of type ( C n ) for t\ (a ) . If 
lim ff (H n -n)(cr} exists, we define this limit as the generalized limit of type 
(#n) for 77(0-). 

We shall prove the equivalence of these two generalized limits. We begin 
by proving some lemmas. 

LEMMA 1. // we represent by E the identical functional operation E6 = 6(6), 
we have the identity 



where for the sake of uniformity ice have set ( Co 77) (0") = f] (cr) 
We have from the definition of ( C n 77 ) ( cr ) 

F 
, / \ / /-/ \ / \ 

&lt;PO 1 0" ) \ L n f\ ) (. & ) = ~, 

&lt;^0 (""I 

(15) 
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Applying the operation D to this equation, and making use of (4), (VIII), the 
property (I D ) of J , and the property (R) of & , we get 



&lt;Po ( cr ) 

) + : _! 7? 



Applying the operation ./ to this equation, and making use of property (/,) 
and equation (15), we obtain 

)(C n 7,)(a) = - (n - l 



Transposing the first term on the right-hand side and dividing .through by 
??(/&gt;o(&lt;r), we have finally the identity (14) for n &gt; 1. For n = I it is an 
obvious consequence of (11). 

Before stating the next lemma we need to introduce the following notation: 

&lt;t&gt; n (a) = 0o(o-)0o(0-i) 0o(0"-2)0(&lt;0 ( &gt; 2), 
(16) 

&lt;fc&gt; (&lt;r) = (cr)0(&lt;7), &lt;i (&lt;r) = &lt; (&lt;r) . 



LEMMA 2. // lim, ( CT ) e.ti?fe arf w equal to a and &lt;f&gt; ( a ) ] &lt; o-i for every &lt;r , 
then lim ff [&lt;/&gt;o (o") j^ 1 (^&lt;/&gt;) (O wiW eari*&lt; a(? fee equal to a/n and we shall hare 



Oi 

n 



Given a positive e, we choose cr so that a e/l &lt;$(&lt;?) &lt; a + e/4 for 
a &gt; ff f . We have 



(17) 

In view of (16), (3), and (VII) we have the relationship 

(18) 



Making use of (18) and postulates If 2 and Ij , we see that the second term on 
the right-hand side of (17) lies between 

and a+.l- 

n 

We see from (IV) that for a proper choice of a &gt; a- . , each of the above ex 
pressions differs from a/n by a quantity less in absolute value than \e for all 
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The first term on the right side of (17) is seen from (18) and (Mi) to be 
less in absolute value than 

o n ( g 



It follows from (IV) that we can choose a", &gt; a[ so as to make this latter 
expression less in absolute value than \e for a &gt; a" . 

If now we choose for &lt;7 t the greater of a" and a" , it follows from (17) that for 
a &gt; &lt;T C , | [ &lt;on ( o- ) I" 1 ( J&lt;j&gt;n) ( &lt;T ) a/ ii \ &lt; c. The first statement in our con 
clusion is therefore established. We may readily infer that the second state 
ment holds also if we note that in view of (18) and postulates ML and Ij, 
we have 

fit , , \ 1-1 / 7 , \ / \ 1 

n ^ n 

Let us set 

1 . , it 1 1 



(19) (O =-4&gt;(&lt;O 

n n &lt;PO(&lt;T 

We shall then prove 

LEMMA 3. //lim, &lt;/&gt; (cr ) c.mte of7 /&lt;? egwaZ to a and | &lt;/&gt; (a) &lt; ai/or CTen/ cr , 
then \im a $ (cr) wi7/ e.r^ and 6e ega/ to a and we shall have c/&gt; (cr) | &lt; a 2 /or 



We define &lt;f&gt; n (ff} in a manner analogous to that in which 4&gt; n (a) is defined 
by (16). Then multiplying (19) by ?i0o(0 )&lt;o (o"i) ^&gt;o (ffn-s) or by 2&lt;f&gt; (cr) 
according as ?i &gt; 2 or n = 2 , and making use of (VII) and (3), we have 

nti(&lt;r) =&(&lt;r) +[(Do,n-i)(ri)]-[(J0)(r)] (n^2). 

Applying the operation J to this equation and making use of (5), (I D ) , and 
( I , we obtain 



) , we obtain 

&lt;r ) - 



Combining the above equation with (19), we have 
&lt;/,(O =K0 (&lt;0 -- . , , W / M " 

00 (. 0" ) 90 (. 0"! ) 90 ( &lt;fn-l ) 

Applying Lemma 2 we see that the second term of the right side of this equa 
tion approaches -- (n -- 1 )a as a limit and remains finite for all cr. Hence 
our lemma is proved for the case n = 2 . For n = 1 it is an obvious conse 
quence of (19). 
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Let us set 



Noting that S n and M are interchangeable operations, we have, from successive 
applications of (14), 



(77 B i7)(&lt;r) = 



We are now ready to prove the general equivalence theorem: 

THEOREM. // lim,, ( C n rj ) ( &lt;r ) exists and is equal to a , then lim ff ( H n rj ) ( &lt;r ) 

e.risi and fee egwaZ to a, and conversely. 
From the last equation above, property ( B ) , and successive applications of 
Lemma 2 for the case n = 1 , we infer that the existence of lim, (C* n r;)((T) = a 
implies the existence of lim^ ( H n rj ) ( a ) = a , for every n . From the same 
equation, property ( B ) , and successive applications of Lemma 3 we draw 
the converse conclusion. Thus our theorem is established. 

UNIVERSITY OF CINCINNATI, 
CINCINNATI, OHIO. 



ANHARMONIC POLYNOMIAL GENERALIZATIONS OF THE 
NUMBERS OF BERNOULLI AND EULER* 

BY 

E. T. BELL 

We consider twelve infinite systems of polynomials in z which for z =-- 1 
degenerate either to the numbers of Bernoulli or Euler, or to others simply 
dependent upon these. The first part proceeds from the definition of an- 
harmonic polynomials to the specific twelve systems discussed; the second 
presents an adaptation of the symbolic calculus of Blissard and Lucas in 
sufficient detail for rapidly developing a simple isomorphism between the 
algebra of the polynomials and that of the twelve elliptic functions sn , en , 
ns , nc , sc , of Glaisher, and the third contains a short selection from the 
simpler algebraic and congruential relations between the polynomials. Inci 
dentally there is pointed out in the second part a new interpretation of 
Kronecker s work on certain symmetric functions and their connections with 
Bernoulli s numbers. Owing to the length of the paper the development 
stops short of the quadratic transformation of the polynomials which corre 
sponds to the transformation of the second order in elliptic functions, but the 
material given is a necessary foundation for all higher transformations. For 
the same reason only prime moduli are considered in the congruences, although 
the case in which the modulus is a power of a prime can be treated in essentially 
the same way, but at greater length. All references are at the end of the paper. 

I. ANHARMONIC POLYNOMIALS 
1. With each of the substitutions a of the cross ratio group on z, 

1 = (z,z), a = (z, 1/(1 -a) ), & = (z, (z -I)/*), 
7 = (z,l/z), d -- (3,1 -z), e = (*,*/(*" D), 

associate a multiplier a" as follows, 



n being an integer = . From these define six linear substitutions upon the 
coefficients a , cii , , a n , with a n ^ , of the polynomial 

A(z) = o + ai z + a-i z 2 + + a n z n 



* Presented to the Society, San Francisco Section, April 9, 1921. 
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o an z n 



by means of the identities 

a A U) = a" A (a 1 z) = o ff0 + ai z + a, 2 z 2 + 

in which A (a z} denotes the result of applying a to -4(2). Hence a A (z) 
is the polynomial derived from A(z} by first operating with a and then 
multiplying the result by a" . If o = the degree of a A (z) is &lt; n for 
some cr . The six a s thus obtained are 1 , a,ft ,y , 5 ,t , where a corresponds 
to a , a" , etc. 

2. A set of generating relations for the cross ratio group is 

a 3 =-- y 2 -- (a y Y =- I. 
We have aA (z) (z l) n A (a f z) ; whence 

o?A(z) = a(aA(z)) = (z - l)[(z/(l - z) Y A (a 2 z) } , 

a?A(z) = a(c?A(z)) =- (z - 1 )[(!/(- 1 ) ) A ( a 3 ~) } --- A(z). 

Hence a 3 = 1 , and similarly y 2 = 1 , ( ay ) 2 = 1 , so that the a form a group 
F simply isomorphic to the cross ratio group. For convenience of reference 
we reproduce its multiplication table, which is to be read in the usual way; 
thus ay = 8, ye = a, and so on: 

1 a ft y 5 e 



a 


a 





1 


5 





7 


P 


ft 


1 


a 


t 


7 


5 


T 


7 


e 


d 


1 


ft 


a 


6 


d 


7 


e 


a. 


1 


ft 


e 





d 


7 


ft 


a 


1 



3. Obviously a yr = ai n _ r = a n _ r . Let p , a , r be substitutions of F between 
which there is the relation per = r , and write a par for the function of the 
coefficients a r which is derived from the &lt;! in the same way that a pr is from 
the o r . Then a ptrr = a Tr , and the coefficients of the six polynomials are 
related as shown by the table in 2. If y&lt;r = T , then ov = a yyr = a an - T 
Hence, writing as usual ! = 1, (o) (o) == 1, () == rl/(sl(r - 5)!) , we 
have the following forms of the a or , r = , 1 , s , . , n , which explicitly 
define the substitutions of F : 

a\ r -= dynr = H T , 

Oar = 0,_ r = (- l)"- r Z(V)a s , 

.=0 



the a ar being given directly by the a A (z)for&lt;7 = l,o:,j8. 
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4. The set A(z), aA(z), 0A (z) , yA (z) , SA (z) , eA (z) is called an- 
harmonic of degree n , n being the highest degree of any polynomial in the set. 
Any symmetric function of the polynomials A(z), aA(z) is an invariant of 
the subgroup 1, a (&lt;j ---- y , 5, e); any symmetric function of A (z) , aA (z) , 
I3A (z) is an invariant of the subgroup 1, a, j3; and any symmetric function 
of a A (z) (a = l,a,j8,7,S,e)isan invariant of T . 

5. When A(z) is reciprocal, yA (z) == A(z) . Hence from the multiplica 
tion table 5^1 ( z ) = a A ( z ) , eA ( z ) = (3A(z), and in this case the anharmonic 
set reduces to A (z) , aA (z), $A(z). We shall call such a set cyclic, and 
henceforth reserve the term anharmonic for sets that are not cyclic or, what 
is equivalent, contain no reciprocal polynomial. Any symmetric function 
of all the members of a cyclic set is invariant for 1 , a. , @ . 

6. The cr are determined by the coefficients a r as in 3 and are functions of 
the degree n of the set. When necessary to designate the polynomial fixing 
the a we shall speak of the associated cr as the cross ratio substitution for that 
polynomial. To indicate that A(z) belongs to a set of degree n we write 
A(n) (2), enclosing the n in ( ) to distinguish it from the rank defined in 7. 
When it is a question of relations between the coefficients of polynomials of 
different degrees ( 15) the a r may be given double suffixes. Thus 

n n 

A(z) = A (n ) (z) = ^a r z r = ^a nr z r 

1=0 r = 

are merely different notations for one polynomial, and likewise for 

n n 

crA (z) = &lt;rA (n )(z} = ^a ffr 



r=0 



Where there can be no confusion we shall use the simpler forms. 

7. Let x , z be independent variables, and put Vz = k , Vl z = k , 
i = \ 1 . Let t, v be functions of x, k (or of x, z) such that 

t(x, -k) = t(x, k) = -t(-x,k), 
v(x, - k) = v(x, k) = v( - x, k), 

and assume (cf. 12) that these functions can be expanded in absolutely 
convergent power series in x of the form 



We now define T( n ) (z) arising from an odd function generator t ( x , k} to be 
an odd polynomial of rank 2n + 1 , and V( n ) (z) generated by an even function 
v ( x , k) to be an even polynomial of rank In , and write 

T M (z) = T, n+l (z), V M (z) = V, n (z), 
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so that the degree of a set containing a given polynomial is the greatest integer 
in half the rank. This change in notation is essential for the further develop 
ment, as without it the application of the symbolic calculus of Blissard and 
Lucas is impracticable. Exhibiting the ranks rather than the degrees we have 



* ( _ 1 \ 

t(x, *) = E T * n+l ^ s sin 






(*. *o = z ( 4)i F2 (s) = cos HS).*-, 

the trigonometric forms being the purely symbolic equivalents of the series. 

The principles of the symbolic method, which we shall use freely in the 
sequel, having been fully treated by Blissard, Lucas and Glaisher, in the works 
cited, we need not recall them here, except to emphasize the caution that in 
all operations with symbolic powers zero exponents must be included. Thus 
the first term of cos V(z)x is T r o(z), not unity; the symbolic binomial 
( P + q ) 2 = P 2 q + 2p l q 1 + pcf = pi g + 2^ q, + p &lt;/ , not p z + 2;^ q l + q, ; 
and ( p p) 2 = 2(pt po pi) , obtained from (p q ) 2 by putting q = p 
in the final form of the latter. 

8. That a is one of the cross ratio substitutions for A n (z) may be indicated 
by writing &lt;r n , but for simplicity we shall put &lt;r n A n (z) = ffA n (z). Anal 
ogously to the sn , en , ns , notation for elliptic functions we denote each 
of the six polynomials lA n (z) = A n (z), aA n ( z ) , j3A n ( z ) , , eA n ( z ) by 
a double letter symbol I A = A , a A , @A , , eA , and in any such a- A regard 
the a, A as inseparable. In Blissard s method A n (z) is written A n (z) , or 
A n when c is understood, the exponent being purely symbolic, and .4 is called 
an umbra. Similarly we now have (&lt;rA) n = &lt;rA n , since &lt;rA is one symbol, 
and our umbrae are double-letter symbols &lt;rA . It is important to note once 
for all that (a A )" is not a n A n in which a" has the usual meaning as a power 
of a substitution. 

9. We require the operations transforming the generators of T , V into those 
of aT , aV respectively. Let/(.r, A-) denote an arbitrary function of .T, k, 
and fi,- an operator which applied to/ transforms it as follows: 



We regard fi,- as a tripartite operator, 



which replaces kinf(x,k) by Xi (k),x by z4&gt;i (k), and multiplies the function 
thus transformed by &lt;j&gt;i ( k ) . Hence the product 12 j fii in which 12 ; is applied 
first is 
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and if / ( x , k ) is odd in x , even in k , 



= [-9, -*,x) == 



while if / ( x , k) is even in both x and k , 



- x 



Let &lt;r , &lt;r denote any cross ratio substitutions for 7^+1(2), V Zn (z) re 
spectively, so that symbolically 



and designate by s , s operators such that 

s t ( x , k) = sin CT T ( z ) .1* , SB ( a; , k ) = cos crF ( z ) .T . 

Then by inspection of the series, the Greek and Latin letters ( a , a ) , 
( a , a ) , , ( a , s ) , ( a , s ) corresponding, we have 

1 = {1,1,*}, 1 = {1, l,k], 

=={- i/k , ik , 1/k } , a=-- {1, ik , l/k } , 

V -- | - i/k, ik, ik /k] , b -- [I , ik, ik /k} , 

c ---- {I/k, k, l/k}, c ---- {l,k,l/k}, 

d = {-i,i,k \, d =- {l,i,k }, 

e = { l/k , k , ik/k \; e ---- [l,k , ik/k } . 

From the definitions the s form a group G which is simply isomorphic to F 
and whose multiplication table is obtained from that in 2 upon replacing 
each Greek letter by its accented Latin correspondent; the multiplication 
table for the group G of operators s is obtained from G by suppressing accents. 
The explicit forms of the operators given above can be verified easily on com 
bining them according to the formulas developed for the fi,-, noting that the s 
refer to an odd, and the s to an even function. 

10. Thus far s has been applied only to the odd t (x , k ) , and s to the even 
v ( x , k ) . The following cases are of equal importance, and they may be 
seen at once from the definitions. If 

* EE { (*), * (fc),x (*)}, then s= {l,* (fc),x (*)}; 

v(x,k), st(x,k) = -J -8 t(x, k). 

9 
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11. We now specialize the even V in each of two ways and the odd T in 
one way, getting in all for each integral degree n = a system of twelve 
polynomials distributed into two cyclic sets and one anharmonic. Indicating 
that the modulus of the elliptic functions is k by writing sn(.r, k) , etc., we 
define the fundamental polynomials 8 , C , P and their anharmonic transforms 
cr S , &lt;rC , aP by their symbolic generators, 

sn ( x , k ) == sin S ( z ) x , s sn ( x , k ) = sin a S (z)x, 

en ( .v , k ) = cos C ( z ) .1- , s en (x, k) = cos &lt;rC ( z ) x , 

x ns ( .r , k ) = cos P ( z ) . ; sx ns (x, k) = cos &lt;rP ( z ) x . 

From the elements of elliptic functions the coefficients in S 2n +i (z) , Can (z) 
are positive integers, (7 2n (z) is of degree n -- 1 in z, and S^n+i is a reciprocal 
polynomial of degree n in z . Hence P 2 n ( z ) is a reciprocal polynomial, since 
x ns .r = .r/sn x . Therefore, omitting ranks from the notation, the coefficients 
in each of the sets a S(z], aC (z) are integers, those in &lt;rP(z) are rational 
but not integral, and of the three sets a- S(z), crP (z) are cyclic, crC (z) is 
anharmonic. 

12. If for all values of z whose absolute value does not exceed a constant 
different from zero the algebraic relation F(z) = holds, then it is easily 
seen that F ( z ) = is an identity in arbitrary z . Hence in all polynomial 
formulas we consider z arbitrary, it being understood that when F (z) is 
regarded as a coefficient in an infinite series z is such as to render the series 
absolutely convergent (all the series discussed have radius of convergence 
&gt; ) , but that in all other connections z is arbitrary. 

13. For convenience of comparison with Glaisher s grouping of the elliptic 
functions into triads we give the complete set of generators : 

sn (x, k) = sin S (z)x , x ns (x , k) = cos P (z)x , 

en ( x , k) = cos C ( z ) x , x cs ( .r , k ) = cos aP (z)x, 

dn ( .r , k ) = cos yC ( z ) a;; x ds ( .r , k ) = cos j3P ( z ) .v; 

dc ( .r , k ) = cos aC(z\x, cd ( .r , k ) = cos tC ( z ) .r , 

nc (.r, k) = cos dC(z)x, sd (x, k) = sin j3S(z)x, 

sc (x, k) = sin aS(z)x; nd (x, k) = cosfiC(z)x. 

Glaisher points out that of the four triads the second is the most symmetrical, 
and that it should be taken as a basis for the jacobian elliptic functions rather 
than the traditional first. From the present point of view the second triad 
frequently appears to be wholly anomalous: while the sets &lt;r S (z) , &lt;rC (z) in 
many significant ways can be regarded as forming one complete system, the 
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set ffP (z) stands apart in the majority of relations of a specific type. We see 
in a moment that crP (z) is related to the Bernoulli numbers, a S ( z ) , vC ( z ) to 
those of Genocchi and Euler respectively. Hence we have another instance 
of that superficial similarity and radical difference between the numbers of 
Bernoulli and Euler which has often been remarked. 

14. When z = 1 and hence k = 1 the elliptic functions degenerate to circular 
functions of the gudermannian of x . From the values of these as given by 
Cayley, p. 59, and the symbolic generators for the B , E, G , R (numbers of 
Bernoulli, Euler, Genocchi and Lucas) in Lucas 5, p. 262, we find at once 

x ns ( x , 1 ) = ix cot ix = cos 2Bix , 

en ( .r , 1 ) = sec ix = cos Eix , 

dn ( .r , 1 ) = sec ix = cos Eix , 

2x sn (x, 1) = -- 2ix tan ix = cos 2Gix , 

x ds ( .r , 1 ) = ix cosec ix = 2 cos Rix , 

x cs ( x , 1 ) = ix cosec ix = 2 cos Rix . 

The even suffix notation of Lucas is used for all B , E, G , R, and the last 
two are defined b 



= 



- 2 2 " ) B 2n , R 2n -- (1 - 2 2 "- 1 ) B, n . 



We require a fifth system of numbers 7/ 2n +i , the so-called tangent coefficients, 
shown presently to be integers &gt; , 

( _. 1 \n+l . 1 \n 

7/ 2B+1 = i - 2 2 " GW+ 2 = f- 2 2 + ( 2 2 + 2 - 1 ) B 2n+2 . 

n + 1 ?i + l 

Comparing coefficients of like powers of x in the generators above with 
those in 13 we have for n ^ 0, r &gt; (note that in each case the degenerate 
form is expressed as a function of the rank 2n or 2n + 1 ) , 

S 2n+1 ( 1 ) = H 2n+l , P, n ( 1 ) == ( - 1 )" 2 2 B, n -- ( 2i ) 2 B, n , 

a S 2n+l ( 1 ) = ( - 1 ) = - i i 2 +i , aP, n ( 1 ) = 2 ( - 1 )- A. = 2i 2 " R 2n , 
)3S 2n+1 (!) = (-!)= - i i*+i; (3P, n ( 1 ) = 2 ( - 1 )- Rm = 2i 2n R, n 
Cta ( 1 ) = ( - 1 )" E 2n = i 2n E, n , yC 2n (!) = (-!)" E Zn = i 2n E Zn , 
afo(l) = l.aCfrCl) =0, 5C 2B (1) = (- 1)" = i 2 " , 

/3C 2B (1) = (-!)" = *; eCo(l) = 1, ef 2r ( 1 ) =0. 

The coefficients in &lt;S 2 n+i(s), r 2n (z) being integers &gt; 0, so also are H 2n +i, 

("I)" ^2n - 

Trans. Am. Math. Soc. 7. 
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15. An identity between some or all of the twelve elliptic functions implies 
and is implied by the identity between polynomials which is obtained upon 
equating coefficients of like powers of the argument x, and as degenerate 
cases for 2 = 1 of these identities we have relations between the numbers 
B, E, H, R. The elliptic identities may conveniently be segregated into 
classes according to the groups G , G of 9. All the identities arising from a 
given one by successive applications of the operations of G belong to the first 
class, all those similarly derived by means of G belong to the second, and there 
are subsidiary classes corresponding to the cyclic subgroups of orders 3,2. 
Polynomial identities derived from elliptic identities of a given class belong to 
one class. Elliptic identities are further subdivided into types according to 
the degree of the identity in sn , en , , and the derived polynomial relations 
are similarly subdivided, the degenerate cases being included. By means of 
the table in 2 and those next given, together with the formulas of 10, we 
write down immediately from any elliptic identity all others of the same type, 
and hence on replacing the elliptic functions by their symbolic trigonometric 
equivalents from 13 we at once infer all the polynomial relations of one type. 
It is clear that once the elliptic identity is given the rest of the process demands 
very little labor. In the next part we develop the symbolic method proper 
to the subject, and this still further reduces the algebra. 

Consider any one of the polynomial relations. This is an identity in z and 
hence, z being arbitrary, it is equivalent to a set of identical relations between 
the coefficients of the several polynomials. The coefficients of S^ n +i ( z ) are 
it + 1 integers &gt; into which H^n+i is partitioned; those of C-&gt; n (z) are n 
integers &gt; into which ( -- 1 )" Ey n is partitioned, and those of P n (z) are a 
rational but not integral partition of ( -- 1 )" 2 2n B 2n . From the explicit 
values of the transformed coefficients in 3 the coefficients of the transforms 
of the fundamental polynomials S , C , P are known in terms of the foregoing 
partitions, and hence the polynomial relation is equivalent to a theorem con 
cerning partitions of a certain kind of the numbers B , E , H of various ranks. 
Similar remarks apply to congruences between such of the polynomials as take 
integral values when z is an integer. This aspect is not further elaborated 
here, as its complete discussion presupposes a knowledge of the arithmetical 
form of the coefficients in C 2n (z), S 2n+ i(z).* It is not difficult to give 
implicit arithmetical definitions of these coefficients, but this is not what is 
required. 

* The well known method of Hermite (10, pp. 265, 269) for calculating C 2n (2) , Si a+1 (2) 
does not give the required information, as recognized by Hermite himself (12, p. 237). His 
second solution (ibid.), as pointed out by the editors of his works, unfortunately is erroneous, 
and even if it were correct it is difficult to see what the general coefficient would be from the 
forms of those given. Writers on elliptic functions seem to have overlooked Hermite s re 
marks in the second citation, and to have assumed that his first paper is sufficient. 
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10. From the multiplication tables of G , G combined with the results of the 
linear transformations of sn (x,k),cn( x , k ) (as given for example by Glaisher 
(6, p. 120)) we have the following for the generators of the &lt;r S, aP: 

sn sc sd x ns x x cs x x ds x 



1 


sn 


sc 


sd 


1 


x 


ns 


X 


X 


CS X 


x 


ds x 


a 


sc 


sd 


sn 


a 


x 


cs 


X 


X 


ds x 


X 


ns x 


V 


sd 


sn 


sc 


b 


X 


ds 


X 


X 


ns x 


X 


cs x 



the argument being .r and the modulus k . Since each set is cyclic the trans 
formations c , d , e are respectively identical with 1 , a , b . Corresponding 
to 5 we have : any symmetric function of all the members of either of these 
cyclic sets is an invariant of 1 , a , b . 
For the anharmonic set the table is 



en dc nd -dn 



nc 



cd 



1 
a 
b 
c 
d 
e 



en dc nd dn nc cd 
dc nd en nc cd dn 
nd en dc cd dn nc 
dn cd nc en nd de 
ne dn cd dc en nd 
cd nc dn nd dc en 



The modulus and argument in each case are k , x . Any symmetric function 
of the members of the following pairs ( en , dn ) , ( dc , dn ) , ( dn , nd ) is an 
invariant of the group (l,c),(l,&lt;7)or(l,e) respectively; any symmetric 
function of dn , nc , cd is an invariant of ( 1 , a , b ) , and any symmetric function 
of en , dc , nd , dn , nc , cd is an invariant of G . 

17. One example will suffice to show how all the relations of a given type 
may be written down from one of them. When k or k occurs as a factor it is 
replaced in the final result by its z-equivalent. The modulus being k , consider 
sn 2 x + en 2 .r = 1 . From this, since &lt;j&gt; (k) = 1 for each s , 

( s sn x ) 2 + ( s en .r ) 2 = 1 . 

Putting s = a we have, by 10 and the form of a in 9, 

( ik a sn x ) 2 + ( en x ) 2 = 1 ; 

and hence from the tables in 16, k sc 2 x + dc 2 x = 1 . To illustrate 
useful processes we shall also consider in detail the effects of operating with s . 
= W,* ,x }. Then 

s [ sn 2 x + en 2 x] = s 1 ==&lt;/&gt; ( k ) ; 
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- ( s sn ( x , k ) ) 2 + ( s en 



and hence for s = a , & sc 2 x + dc 2 a; = 1 . 

With a little practice the transform can be written down by inspection in 
any case directly from 9, 10, 16. Thus the complete set here is 

( 1 ) , en 2 x + sn 2 x = 1 , 

( a or a ) , dc 2 x k sc 2 x = I , 

(borb ), nd x- Psd 2 .i- == 1, 

( c or c ) , dn 2 x + k 2 sn 2 x = 1 , 

( d or d ) , nc 2 .r sc 2 x = 1 , 

(e or e ) , cd 2 a; -f- // sd 2 x = 1 ; 
whence the final forms 



cos 2 aC (z ) x (1 z) sin 2 aS (2 ) x = 1 , 

cos 2 j8C (z ) .T - z sin 2 $S (z ) .T = 1 , 

cos 2 yC ( z ) x + z sin 2 S ( z ) x = I , 

cos 2 8C ( z ) x sin 2 aS ( z ) .r = 1 , 

cos 2 C (s)a;+ (1 - z) sin 2 0S ( z ) .r == 1. 

Simple rules may be devised for writing down the appropriate multipliers 
such as 2, 1 z above, from the forms of the s , s and 10, for the several 
terms of any symbolic identity when the argument z of the polynomials is 
transformed by the substitutions of T . As these present no difficulty we 
omit them. 

II. ISOMORPHISM WITH ELLIPTIC FUNCTIONS 

18. To find the polynomial relation equivalent to a given elliptic identity 
we evidently must consider the properties of products of t symbolic sines and r 
symbolic cosines in the cases r, t &gt; 0; r&gt;0, t = 0; r - 0, / &gt;0. When 
several symbolic factors of a product are identical we proceed as in the follow 
ing example. Let X, ju denote umbrae (8), and suppose the coefficient of 
x n in cos 2 X.T is required. We write cos 2 X.r = cos Xx cos px , find the coefficient 
of x" in cos Xx cos p,x by actual multiplication of the series for cos X.r , cos /U.T or 
otherwise ( 24), and in the result, after each exponent of X , n has been degraded 
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to a suffix, replace /JL by A . Once more we emphasize that in all expansions 
zero exponents must be included. Thus X, n, , v being umbrse, 



the 23 extending to all r ^ , p ^ , , t g such that 

r + p + + t == n. 

As always, 0! == 1 , and after the completion of all formal operations (multipli 
cations, divisions, etc.) X r fj? v l is to be replaced by \ T /J. P v t By con 
vention, for any particular choice of the signs, 

( X M " ? B ) = XV v = X MO ?o. 

19. All letters A, , TT denote umbra?, and the sets A, A , Z, 

Z = (A, M , ,P),(X ,M / , ,T ),(r,, ,*),(? , , ---.T ) con 
tain respectively r, t,f, p letters. For n = write 

(A|A )"= (X M ... pX /i " r ) n , 

in which all the exponents are symbolic, the summation extends to the 2 r+t 
possible combinations of signs within the parentheses, and the outer sign in 
each case is the product of the signs of X , ,u , , r . The important special 
cases t = , r = give respectively 

(A|) n = E(xM p), 

( A )" = (X M --- T ) n . 

The umbral factors of (A A )" are by definition X, M, , p, X , /* , , T , 
and similarly for the others. If in these the letters be interpreted as ordinary 
quantities and the exponents as algebraic we have precisely the symmetric 
functions considered by Kronecker, cf. 26. 

By definition the respective types of (A|A ) n , (A| ) n , ( A )" are (r t) , 
( r 1 ) , ( 1 &gt; an d the weight of each is n , = the sum of the suffixes in the final 
non-symbolic forms of each. Considered as functions of Xj -, , r k these 
final forms are homogeneous of degrees r + t, r, t respectively, the degree in 
any case being equal to the total number of letters in the bar function ( A | A )" , 
etc. The properties of these functions (A|A ) n , are immediate from the 
expressions in 20 for their symbolic generators. From the principles of the 
symbolic calculus as developed by Blissard and Lucas it is evident that prod 
ucts, etc., of symbolic sines and cosines can be combined formally according 
to the rules of trigonometry, and that the coefficients of like powers of .T in the 
several transforms thus effected of any identity are equal. By starting from 
the elliptic functions of a pure imaginary argument the development can be 
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carried out isomorphically to the theory of the hyperbolic instead of the 
circular functions. This in some respects is preferable, but being committed 
to the other by Lucas trigonometric generators for B , E, G, R n 14 we 
shall not follow it. 

20. Replacing each symbolic sine or cosine by its exponential equivalent 
we see immediately that 
2 r+t cos \x cos fjix cos px sin X x sin p. x sin T x 






= ( - 1 ) 2 sin (A A ) x or ( - 1 )* cos (A A ) .r 



according as t is odd or even; 

2 r cos \x cos jU.1- 
and according as t is odd or even, 



cos px = cos (A ) x ; 



t-\ 



2* sin X .r sin / . sinr .r =(- 1 )~ sin ( A ). r or (-l)- cos(A ).r, 

all of which are included in the first. Hence (A|A ) n is an even function of 
each of the r letters in A, and an odd function of each of the t letters in A ; 
or, (A|A ) n of type (r\t) is r-fold even and /-fold odd. Moreover (A] A )" 
is symmetric in the letters in A , also in those in A . Again, from the gener 
ators, (A|A ) 2n = if f is odd, while (A A ) 2n+1 = if t is even. The corre 
sponding statements for ( A )" , ( | A ) n are included as special cases. 

21. In (A A )" we are concerned with (1) the umbra X, , T ; (2) the 
type ( r \ t) ; (3) the weight n . Each of these has a species of addition theorem. 
Denote by (A, Z) the set consisting of all the letters in A together with all 
those in Z , and similarly for ( A , Z ) . Then ( ( A , Z ) | ( A , Z ) ) n is of type 
(( r +/)|(^ + ^))&gt; and is symmetric in A , Z , also in A , Z . The addition 
theorems with respect to types are given by the following, all of which are 
obvious on remarking that the generator of ( ( A , Z ) | ( A , Z ) ) n is the product 
of the generators of (A|A ) n , (Z\Z ) n , and reapplying the several cases of 
the generators in 20 to these products before equating coefficients of like 
powers of x. Let (n , t, p) == (n , t , p ) mod 2; then 



(0,0,0) 


((A A ),(Z 


Z )|) ( 


(0,1,1) 


(!(A|A ), (2 


Z ))" ( 


(1,0,1) 


((A|A )|(Z 


j 1 
/ / \ 


(1,1,0) 


((Z|Z )1(A 


A ))" ( 



((r|0,(/|p)|) 



((/b)l(HO) . 

For example, when n , t , p are all even, a function of type (("+/) |( + p)) 
is expressible linearly in terms of functions of^types ( r \ t) , (f\p) in the manner 
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shown in the second column. The calculation of such a function is thus 
reduced to a series of symbolic multiplications (and subsequent additions) of 
functions whose types are ( r \ t) , (f\ p ) . 

22. Omitting the addition theorems for the umbrae we restrict the discussion 
of those for the weights to the particularly important cases of two and three 
letters. The general case of n umbrae is treated similarly. In all that follows 
multiplications indicated by dots are purely symbolic. These are performed 
analogously to algebraic multiplications but by the addition of suffixes instead 
of exponents. Thus 

(X r + Mr) (X, + M&lt;) = X r -X( + X r M&lt; + Mr Xj + Mr M* 

= \ r+t + \ M&lt; + Mr X ( + Mr+&lt;; 

X 2 -X 3 = Xs; X 2 -X 2 = X 4 , not Xa . Algebraic multiplication is a special case 
of this. 

Taking first the case of two umbra; X , M we define auxiliary functions { } by 

2{X,M|} r = 4 r + * r , 2{ X, M } s0 -y, 

4&gt;=\ + H, \f/=\ n, r^O, 
and put L r = = L r == {X, M } r , M r == M r == j X, n\ r . Then 

4L 2r ---- ( A , M | ) 2r , 4J/ 2r == ( | X , M ) 2r , 4J/ 2r+1 == ( A | M ) 2r+1 . 
We have &lt;j) r ---- L r + M r , y == L r -- M r , and hence for t ^ 0, 

4&gt; r+t = (L r + M r ).(L t + M t ), t+ -- (L r - M r )-(L t - M t ). 
On the other hand 

2L r+t = &lt;f&gt; r+t + \j/ r+t , 2M r+t = (j&gt; r+t - f +t . 

Whence, substituting for &lt;j&gt; r+t , 4 /r+t an( i degrading exponents, we have the 
addition theorems 

Lr+t =- L r L t + M r M t , M r+t =- L r M t + M r L t . 

For example LI == Xi Mo, L z = X 2 Mo + M2 X , MI == X Mi, M% = 2Xi MI; 
hence L\-Lz = Xs MO + Xi M2 , Mi-M* = 2Xi M2 Again 

L, = |[(X + M) 3 + (X -M) 3 ] = X 3 Mo + 3X lM , , 

so that L 3 = LI- L?. + Mi-Mz as required by the first of the theorems. 

23. The case* of three umbrae X , M , v is treated in the same way, and we need 

* If X , n , v be interpreted as ordinaries and exponents as algebraic, the special cases of the 
addition theorems of the first kind (in 21) for the function]-; ) X , n,v\ } , etc., which are of the 
same form as those for (X, n, "\Y , etc., have interesting, consequences when r is prime for 
Fermat s quotients. Cf. Bachmann, Journal fur Mathematik, vol. 142 (1913), 
pp. 41-50; Dickson, History of the Theory of Numbers, vol. I, p. 111. 
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give only the results. There can be no confusion between a , )3 , 7 , 5 here and 
the substitutions of T , nor between B and the Bernoulli numbers. Write 



- 4{X, 
r = 4{ X, 



} r - a r + /3 r -- Y - 5% 
*} = r - /3 r -- Y + f&gt; r 



a. = X + ju + v , 
ft = X n + v , 

7 = X + fji v , 
= X - M - v , 

and put A r ,B r ,C ,D r ==A T ,B r ,Cr, ]&gt; r We have 

SC 2r+1 = (|X,ju,") 2r+1 , S# 2r (" x 
and the addition theorem for r, i = 0, 
^4 r+J = A r -A t + B r -Bt 

B r+t == Ar B t + B T -A t 

C r +i == A r -L-t ~r D T -Ut 
D r+t =- A r -D t + B r -C t 

There also are multiplication theorems for the calculation of A nr , etc., but 
we shall omit these. 

24. As they are frequently useful we write down the { } equivalents for 2 
and 3 letters of the generators in 20: 



C r A t 

Cr-B t 



D r -D t , 

Dr Ct, 
D r -B t , 



x = cos 



, X 



Xj.v, 



cos X.T cos /j,x = cos { X , 

sin X.T sin /JLX = - cos f |X, //}.* = cos 

cos X.T sin JU.T = sin |X|/xj.i- = sin {ju, X }.r, 

sin X.r cos /i.r = sin {ju|X}x = sin }X,ju|j.r; 

cos X.r cos ju.r cos vx = cos } X , ju , v }.v = cos j { X , jix | } , v \ } .r , 



cos 



X.T cos /z.c sin ra = sin }X, ju i/}a- = sin {y, JX, jtt 



sin X.T sin jua: sin vx = - sin { X, /z, i&gt;}.i- = -- sin [v , { X, JLI} } 



sin Xa- sin /xa; cos vx = - cos {v \, 



= -- cos 



|X, 



, v x. 



25. None of the twelve polynomials can be computed by linear recurrence, 
that is, by an ordinary difference equation of finite order, since otherwise an 
elliptic function would satisfy a linear differential equation of finite order and 
the first degree. We now give two symbolic linear recurrences by means of 
which, in conjunction with the foregoing addition theorems, the actual com 
putation of all the polynomials can be effected systematically without reference 
to elliptic functions. Or, by using the results of 30, 31, the polynomials can 
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be calculated directly from the recurrences. The addition theorems in any 
case shorten the labor but are not indispensable. 

To find the recurrences for L , M of 22 we proceed as if X , M were ordinaries, 
forming the equation in w whose roots are $ , \{/ and multiplying the result by 
w n , finally degrading exponents and replacing ordinary products by dot 
multiplications ( 22). The formal algebraic details parallel those for the 
case of ordinaries, as given for example by Lucas 5, pp. 308-310, and may be 
omitted. For ic = either L or M , and n = 0, we find 

W n+ l 2X 1 +! + (Xo M2) n = 0, 

with the initial values directly from the definitions of L , M , 

Lo = Xo Mo , LI = Xi MO, ^/o = 0, 3/j = X MI 
Thus for n = , w = L , we have 

Lo = 2Xi LI ( Xo M2 ) - L = 2Xi Xi Mo ~ ( X 2 M2 ) Xo MO 

= 2Xo fj. Xo Mo + Xo M2 = Xo Mo + Xo Ms j 
which is correct. Put n = 1: 

Z/3 = 2Xi ( Xo MO + X M2 ) " ( X 2 M2 ) Xi MO = X 3 MO + 3Xl M2 , 

and so on. 

Similarly if w is any one of A , B , C , D , 

U n+4 -- 4Xl"W n+3 + 2(3X 2 M2 ?2 ) H n+2 ~ 4 ( Xa Xl /*2 ~ M2 ?! ) M n+1 

+ ( X 4 + M4 + ^4 2X 2 M2 2M2 v-&gt; 2^2 X 2 ) u\ = 0; 

A = Xo Mo fo, B = 0, 

AI = XiMo^o, BI Xo Mo v\ , 

A-2 = Xo MO ^0 + X M2 ^0 + Xo MO V-2, -62 = 2Xl MO Vl , 

AZ = X 3 Mo VQ + 3Xi M2 VQ + 3Xi MO ^2! B 3 = X Mo v$ + 3X 2 Mo v\ + 3X M2 v\; 

Co = 0, D = 0, 

Ci = 0, DI = XOMI"O, 

C o = 2Xo Mi v \ -^2 = 2Xi Mi ^o . 

G = 6X1 MI "i", -^3 = X M3 ^o + 3X 2 Mi vo + 3X Mi v i 

Provided multiplications be interpreted as above, viz., as dot multiplications, 
the theory of such symbolic recurrences is obviously identical in its formal 
aspects with that of ordinary recurrences. Hence for L , M we have at once 
isomorphs of all the algebraic formulas developed by Lucas for his U , V . 
Thus, for example, L , M are expressible as symbolic continuants, cf. Lucas 4, 
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p. 193. Similarly A, B, C, D have symbolic properties completely 
analogous to the algebraic relations between any system of four independent 
solutions of 

y n+ 4 + py n +3 + qy n+ -i + ry n+ i + sy n = 0; 

in particular there are symbolic equivalents of the generalized continued 
fractions of Jacobi, Fiirstenau, and others. This can be continued in the 
same way for functions of 4, 5, , n umbrae. 

There is a more general aspect of the formulas of this part which is useful 
elsewhere. The development of any function of .r which is even, odd, or 
arbitrary in .r can be written in the symbolic forms 

cos r]jc , sin f.r , cos 77.1- + i sin fur 

respectively, and the coefficient of .r" in the product of any number of such 
functions can be most readily investigated by the symbolic trigonometry 
which we have sketched. When n functions are concerned the symbolic 
recurrences are of order n . 

One special case of interest may be mentioned. As remarked in 19, ( A | ) n , 
( | A )" , when exponents are interpreted as algebraic and the umbrae as ordi 
naries, are the symmetric functions of Kronecker 13, p. 385, which he took as 
his point of departure for deducing certain properties of the Bernoulli numbers. 
The step-by-step symbolic interpretation of Kronecker s analysis may be made 
without difficulty, and all his formulas translated into terms of umbrae in 
stead of ordinaries. The formulas thus derived contain his as limiting cases. 
Bernoulli numbers enter Kronecker s formulas as coefficients in the expansion 
of hyperbolic tangents, and hence the manner in which they appear is quite 
distinct from that of the present discussion. 

26. Corresponding to any product of elliptic functions there is a unique 
product of symbolic sines and cosines, and hence a unique (A|A ) n . From 
any elliptic identity we write down an identity between bar functions of general 
odd or even weight. When the identity contains ns x , cs .r or ds x it is first 
multiplied throughout by the lowest power of x such that these functions can 
be replaced by x ns x , x cs x or x ds x respectively wherever they occur. Now 
the umbrae X, M&gt; , T in (A|A ) n are its umbral factors ( 19), and hence 
the isomorphism between elliptic and polynomial identities is complete. 

III. ALGEBRAIC RELATIONS AND CONGRUENCES 

27. For z =-- 1 the polynomials degenerate to Bernoulli and allied numbers, 
14. Let w denote a complex cube root of unity. The cases z = --co, 
z = \ , which by an obvious analogy may be called the equianharmonic and 
harmonic, are noted here in passing. They are characterized by the vanishing 
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of certain invariants ( 4). Write 
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A n (z) A n (z) Al(z} 
A n (z] Al(z) A n (z) 
Al(z} A n (z) A n (z) 



Then from the table in 2 , 



A n (z) 



aA n (z) aA 



and from the values of a , in 1, it is evident that this vanishes when 

z = a z , z = j3 z , a z = $ z , that is, when z = -- w , co 2 . Hence 

/(- to) = /(- w 2 ) =0, 

and therefore from the tables in 13, 16 we have, when u = - w or w 2 , 
Sln+i (u) + mSi, +1 (u)+ j8S| n+1 ( u ) 

- 3S 2n+ i ( u ) aS, n+1 ( u ) j8S 2n+ i ( M ) = , 
PL+i 00 + aPL+i (u)+ /3PL +1 ( u ) 

- 3P 2B +i ( u ) aP 2n+1 ( M ) /3P 2n+1 ( u ) = , 

C1 B ( U ) + aC| B ( U ) + C 2 3 n ( M ) - 3C 2n ( M ) C 2n ( M ) /3C 2rt ( U ) = , 

7^:1, ( u ) + et l,, ( u ) + 56 L ( u ) - 3 7 C 2rt ( u ) e(7 2n ( 7&lt; ) &C* n ( M) = , 

the fourth of which follows from the third on transforming by 7 , or it is 
independently obvious from the last table in 16. The product of the last 
two is another invariant of the same form which vanishes for the same values 
of u. This invariant corresponds to the entire last table in 16, the deter 
minant of which is a circulant of the sixth order, and hence, by a well known 
theorem of Glaisher, is expressible as a circulant of the third order. From the 
values of a." , j3" in 1 and the form of I n (z) each of these invariants vanishes 
also when z = , 1 . The harmonic invariants are written down in the 
same way. 

28. Applying the first formulas of 24 to the final set in 17 we have, for 
n &gt; 0, 

{C,C\} 2 - ||S,S} 2 = 0, 

\aC,aC\} 2 + (1 -2)1 aS,aS} 2n == 0, 

= 0, 



2n 



{&C,8C 



aS,aS\ 2n = 0, 



[eC,eC\} 2n - (1 -z){\f3S,(3S} 2 = 0, 
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in which C = C (z) , and likewise for the rest. For z = 1 we get from 14 
the corresponding degenerate form, (i = V 1 ) , 

[iE,iE\} 2n - { H,H} 2n = 0, or (- 1 } n \E,E } 2n == { \H , H} 2n , 

from the first or fourth of the above. The remaining degenerate forms are 
identities between powers of i . In full the degenerate relation is 

(--!)"( % ) E 2n -lr E 2r = E ( 2r 2 -"l ) #2n-2r+l //2r-l - 
r=0 r=l 

Similarly for the polynomials P with n &gt; 1 , 

|P, P\} 2 "- !/3P,/3P|i 2 " == 0, [P, P\\ 2n -- [aP,aP } 2n == 0, 

2 2n 2 [B,B } 2n - [R,R\} 2n = 0, 
the first of which corresponds to 

01? ns 2 x .r ds 2 .r = .r 2 A 2 , 

and the second comes from this as shown in 17 by transforming with respect 
to a . The degenerate relation reduces by 14 to 

( i: ) ( 2 2 -- 2 - 1 + 2 2 - 1 - i ) 5 2n _, r s zr = o . 

r=0 

29. It is well known that all the algebra including the addition theorems of 
the elliptic functions follows from the square relations en 2 x + sn 2 x = 1 , 
etc., and the expressions for the derivatives of sn x , en x . Hence all relations 
between the polynomials are implicit in 28 and the formulas next given, 
which are written down from the expressions for the derivatives of sn x , en .r , 
x ns x and their transforms by 16, or directly from the first member of each 
of the three sets as suggested at the end of 17. The first formulas of 24 
are used for all, and the results may be checked at a glance by comparing with 
Glaisher 6, p. 92. Thus, the dot being as in 22, we obtain from 

sn (.r, k ) = sin S (z)x, 
by differentiation, 

en ( .1- , k ) dn ( .1- , k ) = Si ( z ) cos &lt;S ( z ) x , 
or 

cos C(z}xcosyC(z)x = Si(z)- cosS(z)x, 

which gives at once the first of the following: 

S, n+l -- [C,yC\ } 2 ", (2n + \)P 2n+2 - - {P,0P| } 2n+2 , 

aStn+i -- [aC,8C\} 2n , (2n+ 1 ) aP 2n+2 = - {P,j8P| } 2n+2 , 

/3S 2B+1 == [PC, tC\} 2 ; (2n + l)/3P 2n+2 = - [P,aP } 2 "+ 2 ; 

C, n+ , = { 8 , yC } 2 +1 , 7 C Zn+z = z[S,C\ } 2 + , 

- ( 1 - z)\aS, dC\ } 2n+l , 8C 2n+2 = - { a S, aC\ } 2n+l , 
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in all of which n ^ . By means of the L , M addition theorems ( 22) and 
recurrences ( 25) all of the polynomials can be calculated successively, and 
without excessive labor, from the initial values for n = 0, 1 (which are given 
by the definitions). Putting z = 1 as before we find the degenerate forms: 



- - {R,R\} 2 + 2 , 
= - {2B, R\ } 2 "+ 2 ; 



# 2n+ i == (- !){, E\\ 2 \ (2n + l)2 2n B 2 

E 2n+ 2 -- ( - ir +1 \H, iE\ } 2n , (2n + 1) 

whence, by comparison with those in 29, 

- { H, II} 2 " , ( 2 + 1 ) 5 2n+2 = - {B,B\ } 2n+2 . 



30. As a last example of relations involving bar functions of not more than 
two umbrae we take the identities which express the elliptic functions as prod 
ucts of two others, and those between the functions and their reciprocals. 
The argument is 2 , as before. 

S 2n+1 =- 

aS 2n+l -- 

!3S 2n+l = 

P, n -- 

B = {0P,tC\} 2 -- [P,C } 2n , 
n = [P,jC\\ 2n = !aP,aC|! 2 "; 



C 2n = {S,aP 



|2n+l 



+ l)aC 2n = 



.n ---- {8C,yC\} 2 \ 



-- {C,aC\} 2 ", 



(2n 



2n = [aS,P 



5C 2n = 



12n+l 



, C\ ! 2 ; 



whence the degenerate forms, 
H 2n+l = (- !){!, E\}* +1 , 



= [R, 1 } 2 ", 
= {1,2B ! 2 +1 , 



1,R J 2 +i = 0, n &gt; 0, 
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31. The simplest general relations between polynomials of one kind and of 
several ranks are found from the differential equations for the elliptic functions. 
Let X, n, v denote umbrae and consider functions v, t, ic , u of x such that 

v = cos X.T , v" = av + 2bv 3 , 



it- = xu = cos vx , u" = gu + 2/(/r f , 

in which o , b , c , d , g , h are independent of .r , and double accents denote 
second derivatives with respect to x . Then v , t , u are elliptic functions, and 
hence X, /z, v polynomials in z (= the square of the modulus). From the 
differential equations for v, t, we get by substituting the symbolic trigo 
nometric equivalents of v, t, w, performing the differentiations and equating 
coefficients of like powers of .r , 

X 2n+2 + X,, i + 2&iA,X,X|) 2 " = 0, 

Man+s + fM2+i -2d{ M, M, M) 2 " +1 = 0, 

n(2n + l)? 2B+a + (n + l)(2n + l}gv ln - h{v,v,v } 2n+2 = 0, 

in which we have used the last formulas of 24 for the coefficients* in r s , t 3 , u 3 . 
Knowing the values of the constants in the cases \ = C,p = S,v = P from 
Glaisher 6, p. 122, we write down the rest of the following table by the 
methods of 17. The argument is z throughout. 

(X, .,&) = (fj.,c,d)= (v,g,h) = 

(C,2z-l, -z), (S, - l-z,z), (P, -1-z/l), 

(a,-l-z,l), (aS,2 -z,l -z), (aP, 2-3,1), 



(yC,2-z, - 1), 
(6(7, 2z~ 1,1 -a), 
(C, -- 1 -2,2); 

We omit the degenerate forms. The bar functions of X , ju , v may be com 
puted by the addition theorems of 23 and the recurrences of 25, and hence 
each of the polynomials for all ranks can be calculated independently of the 
remaining polynomials. In using 23, 25 we perform all operations for 
three distinct umbrae, putting these equal to one another in the successive 
final steps (see 18). We must pass on to a brief discussion of the congruences 
for prime moduli. 

* The differential equation for w is x- w" = 2.rtr + (gx- 2 ) ir + hu 3 
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32. Henceforth p denotes an odd prime &gt; 0. (Some of the congruences 
are also valid modulo 2, but this case is of such slight interest that we ignore 
it.) The theorem of arithmetic which gives the congruences modulo p is due 
to Lucas 4, pp. 229-230: 



O-GOCi) 



mod p , 



m = m\ p + m i , n == n\ p + n{ , = m( , n( &lt; p . 



We shall in future omit the "mod ;/ in writing congruences, and use "= " 
only in the sense of congruence modulo p . From Lucas theorem it is easy 
to infer that , X , /JL denoting umbra of integers and the significance of the dot 
being as in 22, 

( x M ) rp+s = ( X 71 M" Y ( x n )" , o s s &lt; p , 

the upper signs or the lower being taken throughout. By repeated application 
of this we find the following. Let 

N = r n p n + r n ^ p n ~ l + r n _ 2 p n ~ 2 + + r^h + r 

be the (unique) expression of N in the scale of p , so that =i r } &lt; p ( j = , 1 , 
, n ) . Then 



(X - )*= (X"" - M p ") r "-(X p "- 1 - ju""" 1 ) "- (X" - M P ) ri -(X - M ) r . 
Denote the right-hand members of these congruences by 

n (x p "" j M P "~-T"~ J 

3=0 

respectively, in which the accented II indicates that the products of the 
several factors are to be performed as dot multiplications. Then 



j=0 3=0 

{ lx, M)- V = ifr (x p - j + ^- f y- j - m (x p - j - ^"- j y-&gt;. 

)=0 5=0 

The distributed non-symbolic equivalent of each right-hand member after 
the performance of all dot multiplications and the degradation of all exponents 
is a quadratic form with positive integral coefficients of the form 



r n -i 
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and the term of which this is the coefficient is 



= r n 



That the coefficients are positive is evident from the left-hand members in 
which all coefficients are such; the coefficients on the right are the positive 
residues modulo p of those on the left. 

If p N =-- [\,n } N ,orifp N == { \, /i} v , the above congruences are applicable 
in an obvious manner, equalities of this sort having been considered in the 
preceding sections. From 15 it is clear that any congruence between poly 
nomials (the P set does not immediately enter the discussion since the coeffi 
cients are not integers) is equivalent to a system of congruences between the 
linear functions of the form in 3 into which //, E and their anharmonic 
transforms (the degenerate forms of the corresponding polynomials) for 
different ranks are partitioned. 

Two important special cases of the general formula in this section are 



Hence in all cases we have the residue of a bar function of two integral umbrae 
when the rank of the function is equal to or greater than the modulus. The 
case of two umbras will be completed for a prime modulus when we find, as 
next, the residue when the rank is less than the modulus. 

33. Restating another theorem of Lucas 4, p. 229, in a form adapted 
to our purpose we have 



( &lt; r &lt; p , n &lt; p r ) 
Hence for r &gt; , 

( X + M )"" = ( - 1 ) 



_"^{r + s-l\ 
= 2J ) 

i=0 \ s J 



_ -. 

(X - ft)*- = 2J Xp_ r _ sJ u s ; 



and hence, including all cases (cf . 24), for h =: , 

(p-2ft-l)/2 797 I 9 

*- 1 ^ 



(p-2ft-l)/2 797 I 9 \ 

^ E ( ) 

s=0 \ *~ s / 



x P - 2ft _ 28 _ 1M 2 S5 



s=0 
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&lt;"-^&gt;&lt; 2 / 2h + 2s- IV 

I I A, 1*}^ } Xp-2/,-2, 02.-1 

s=0 \ A* -- 1 / 

34. The congruences in 33 admit of immediate generalization to any 
number m of umbrae X , p , , ju . It is easily seen in the same way, or as a 
consequence of the fundamental formula of 33, 

(x + M )*-.= (X" + M )r.(x + M ) 3 , 

that we have 

(X+p + -.. +M) rP+S - (X"+p + - + M P) .(X+p + +M ).; 

and hence in the previous notation, 

(x + p + + M)- Y = E (x p " y + P""" + + M""~T"- , 

i=Q 

with 2 m -- 1 similar congruences in which the signs of some or all of X , p , , /JL 
are changed throughout. From these the congruences for the general bar 
functions of 19 follow at once. The particular case m == 3 is of interest in 
connection with the formulas of 32, but the length of this paper precludes 
further discussion. From the general formulas given there is no difficulty in 
writing down the congruences for bar functions of two or three letters from 
the algebraic relations developed earlier in the paper. For z =- 1 some of the 
degenerate cases are well known, and hence provide checks. The other 
degenerate cases may be checked directly in a similar manner. 
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NEW PROPERTIES OF ALL REAL FUNCTIONS* 

BY 

HENRY BLUMBERG 

INTRODUCTION 

In a former paperf the author communicated a number of properties of 
every real function / ( x ) ; these were stated in terms of the successive saltus 
functions associated with a given function. The present paper makes no use 
of the saltus functions, and the new properties are direct qualifications of 
/(a;). Since /(.r) is entirely unrestricted, except, of course, that it is 
denned | and therefore finite for every real x these qualifications are con 
sequences of nothing else than that /(a;) is a function. A new light is thus 
thrown upon the nature of a function. 

The new properties are of two types, descriptive and metric; the former 
are concerned with density, and the latter with measure (Lebesgue). 

For the sake of greater concreteness of exposition, we shall discuss, for the 
most part, planar sets and real functions of two real variables. 

1. DESCRIPTIVE PROPERTIES 

We shall say that a planar set S is an I-region ( = open set) if every point of 
S is an inner point of S; i.e., no point of S is the limit of a sequence of points 
not in S . 

We shall deal with binary relations 9i between /-regions and points. /9JP 
shall mean that the /-region / has the relation 9? to the point P . The relation 
9? is said to be closed, if the relationships /9?P n and linin^.^ P n =-- P imply /9?P . 
By a neighborhood of a point P , we understand an /-region containing P; by 
a partial neighborhood of P, an /-region of which P is an inner or boundary 
point. A neighborhood of P is, therefore, also a partial neighborhood of P . 
We have the following 

LEMMA I. // 9? is a closed relation, then the points for which (a) NSIP for 
every neighborhood N of P , and (b) a partial neighborhood N &lt; exists such that 
N &lt; SRP (i.e., N &lt; 9?P is false) constitute a non-dense (i.e., nowhere dense) set M . 

* Presented to the Society March 29, 1919, and December 30, 1920. 

t Certain general properties of functions. Annals of Mathematics, vol. 18 (1917), 
p. 147. 

| Even this restriction may be partially dispensed with; cf. Section 2. 
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Proof: Since 9v is closed and N&lt; 9fP, it follows that no sequence of points P n 
exists such that lim P n =-- P and N &lt; 9vP for every n . Hence a neighbor- 
hood N of P exists such that N &lt; MA for every point A of N . In particular 
N&lt; MB for every point B of NN &lt; , the set of points common to N and .V&lt; . 
Since N &lt; is a neighborhood of B , it follows that no B belongs to M . Every 
neighborhood of a point P of M thus contains a subset, which is an /-region, 
every point of which is outside of M . 

Now let 2 = f(x, y} be any given real function of the two real variables 
x and y , defined for every point of the plane II, which we take to be the AT 
plane of a cartesian system of coordinates. We shall write also z =f(P), 
where P == ( .v , y ) ranges over II . For every pair of real numbers ( .r , y ) , 
there is a single number /(.r, y) ; otherwise/ is unrestricted. We define as 
follows the relation 9? ri r 2 , where ri and r 2 are two real numbers and r\ &lt; r 2 : 
// P is a point and I an I-region of U , then I$R Tl r 2 P , if and only if an infinite 
sequence of points P n of I exists, such that 

lim P n = P , lim f(P n ] exists, 
and 

(P B ) =ir 2 . 



It follows that 9? riri is closed. For suppose that lim P M --= P and IM^ P M 
for every n . Then, for every n , there exists a sequence 



of points of 7 such that lim m _^ P == P (n) and n ^ Iim m _ 00 /(P2 ) ) ^ r 2 . 
From the Pm } , we may, in view of the preceding relations, select a sequence of 
points { Q n } , n = 1 , 2 , , such that 

lim Q n = P and r : g lim inf / ( Q n ) S lim sup /((?) ^ r 2 ; 

and from the sequence {Q n } a subsequence {P n } such that lim/(P n ) exists. 
We then have 

lim P n ---- P and n ^ lim/(P n ) &lt; r 2 . 

9J ri r 2 is therefore closed. 

By the use of Lemma I, we may thus conclude that for every function 
f(x, y) and every pair of numbers ri &lt; r, the points P of the AT plane for 



which (a) NVtr^ P for every neighborhood N of P and (6) A r &lt; 3f ri ,. 2 P for 
some partial neighborhood 7V&lt; of P constitute a non-dense set T fl r, Let 
T be the sum of all the sets T ri r 2 , ri and r 2 ( r\ &lt; r 2 ) ranging independently 
over all the rational numbers. T , being the sum of a denumerable number 
of non-dense sets, is exhaustible* (i.e., of first category according to Baire). 

* For the terminology cf. Denjoy, Journal de Mathematiques, ser. 7, vol. 1 
(1915), pp. 122-125. 
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We shall now use this property of T to obtain a property of all functions. 
For this purpose, we introduce the notion of dense approach. The function 
/ ( x , y) is said to be densely approached at the point ( , ij ) , or in other words, 
the point ( , 77 , $ ) , &lt;; = / ( , 77 ) , of the "surface " z = / ( .T , y ) is said to be 
densely approached, if for every positive e there exists a planar neighborhood 
N of ( , 77), such that the points (x , y) of N for which f(x, y) 
~~/( *? ) I &lt; ( form a dense set in N . 

We have the following 

THEOREM I. For every real function f(x, y) u hatsoever, the points of the 
surface z = f(x,y) that are densely approached form a residual set (= com 
plement of an exhaustible set). Conversely, given any residual set R whatsoever, 
a function f ( x , y) exists that is densely approached at and only at the points of R . 

Proof.* The points of II i.e., of the surface z =f(x,y) are either 
isolated or not. The isolated points of II form a denumerable set; therefore 
the points of II whose corresponding points of II are isolated form a denumer 
able and therefore an exhaustible set. It is consequently sufficient to prove 
that the points P of II , for which P is a limit point of II but is not densely 
approached by II , form an exhaustible set. Since P is not densely ap 
proached, there exists an e such that for every neighborhood N of P , the points 
Q of N for which |/(Q) /(P)| &lt; e are not dense in N; i.e., an /-region 
exists in every neighborhood N of P such that / ( Q ) / ( P ) | == e for every 
point Q of the /-region. Hence there exists a partial neighborhood N &lt; of P 
such that the inequality | / ( P ) f(Q.)\= t holds for every point Q of N &lt; . 
Let now ri be a rational number between / ( P ) e and / ( P ) ; and r 2 a 
rational number between / ( P ) and / ( P ) + f Then it follows from the 
inequality |/(&lt;?) -/(P) S e for all points Q of N &lt; that the set [Q } QtaN&lt; 
i.e., the set of points of II corresponding to the points Q of A T &lt; has no 
point ( , 77 , z ) as a limit with n ^ z ^ r 2 , where ( , r/ ) = = P. Hence 
N &lt; 3? ri ,. a P . On the other hand, P is a limit point of II and hence of every 
N where N is a neighborhood of P; hence N has (, 77, f), with n &lt; f 
= / ( s , t\ ) &lt; r 2 , as a limit point, and P is therefore a point of the non-dense 
set Tr ir , associated with the closed relation 9? Tl r 2 , and hence of the exhaustible 
set T which is the sum of all the 7V ir ,, ri and r 2 ranging independently over 
all the rational numbers. The points of II that are not densely approached 
thus constitute an exhaustible set. 

The proof of the converse is immediate. For let E be the exhaustible set 
complementary to the given residual set R . We may write 

E = Ei + E z + E n + 

* If S is any subset of n , we shall understand by S the set of points of the surface 
z =f(x, y) corresponding to the points of S. Thus n represents the totality of surface 
points. Analogously, P will represent the surface point corresponding to P . 
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where the E n s are non-dense in II and no pair of E n s have common points. 
Let 

f(P) =-, if Pis in E n , 
n 

and 

/(P) = 2, if Pis in R. 

It is then clear that / ( P ) is densely approached, if and only if P is in R . 

Theorem I shows what a remarkable degree of "regularity" every function 
possesses. This may be better realized, perhaps, by using the following 
equivalent definition of dense approach. The function /( x , y) is said to be 
densely approached at P, if for every partial neighborhood N &lt; of P, A T &lt;- has 
P as a limit. We thus get the following equivalent theorem we omit a 
restatement of the converse which shows a kind of "microscopic symmetry" 
in the structure of the surface z = f(x, y) for unconditioned /. 

THEOREM I . With every function f (x , y) whatsoever, there is associated a 
residual set R dependent on f of the XY plane such that if P = (, 77) is a 
point of R , and N &lt; a partial neighborhood of P , the set N &lt; , which consists of the 
points of the surface z = f ( x , y} that correspond to the points of N &lt; , has ( , 77 , 
/(; 7 ?)) as a limit point. 

We pass now to further discriminations in the manner of "approach." 
The function / is said to be inexhaustibly approached at the point P of II , or 
in other words, P is inexhaustibly approached by II , if every neighborhood 
of P contains, for every e &gt; 0, an inexhaustible set of points i.e., a set that 
is not exhaustible, at which / differs from / ( P ) by less than e . 

The function / is said to be exhaustibly approached we then say also that 
the point P = ( .r , y , f ( x , y)) is exhaustibly approached by II at the 
point P = (x, y) , if it is not inexhaustibly approached at the point; in other 
words, if a neighborhood N of (x , y) and a number e &gt; exist, such that the 
points of N where / differs from f(x,y} by less than e form an exhaustible set. 
If M is any planar set, we shall use, in connection with approach, the expression 
"via M" to indicate that (x, y) is restricted to range in J/. Thus "/ is 
inexhaustibly approached at P via M" means that for every neighborhood N 
of P and every e &gt; , the set M N , which is the aggregate of points common 
to M and to N , contains an inexhaustible set of points at which / differs from 
/(P) by less than e. The following definition for exhaustible approach is 
equivalent to the one above: The point P of II is exhaustibly approached, if 
a sphere S exists with P as center such that the points of II corresponding 
to the points of Tl S form an exhaustible set. 

We have the following 

THEOREM II. For every function f(x, y) , there exists in the XY plane a 
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residual set R, dependent on f , such that if P is a point of R, and N &lt; a partial 
neighborhood of P, the function f is inexhaiistibly, and therefore densely, ap 
proached * at P via RN &lt; . 

Proof. We first note that the points of IT where f is exhaustibly approached 
form an exhaustible set E\ . For space contains a denumerable dense set 

A = {Ji, A 2 , , A m , ! . 

Associate with each A m a sequence of spheres S mn , n == 1 , 2, , with A m 
as center and l/ as radius. Blacken every sphere S mn the interior as well 
as the boundary which is such that the totality of points of II contained in 
it corresponds to an exhaustible subset of II . Let E\ be the subset of II con 
stituted by the points which correspond to points of II that lie in one or more 
black spheres. Since the number of black spheres is at most denumerable and 
each black sphere contributes an exhaustible set to E\ , it follows that EI is 
exhaustible. If P is a point at which / is exhaustibly approached, P is in 
at least one black sphere; hence P belongs to EI. 

Now suppose we remove from II the set EI of points at which/ is exhaustibly 
approached, thus obtaining the set #1 == II -- EI of points where / is inex 
haustibly approached. Since the values of / in an exhaustible set cannot 
affect the property of inexhaustible approach, every point of R\ is inexhaustibly 
approached via RI . Furthermore the points of RI , at which / is densely 
approached via RI , form a residual set of II . For assume for the present that 
/ is bounded, we shall later drop this restriction. Let k be any number 
greater than the least upper bound of/. Let the function g(x, y) be equal 
to A- at the points of EI and equal to f(x , y) at the points of RI . Since the 
set of points at which g(x, y) is densely approached forms a residual set R g 
of which only an exhaustible subset can lie in EI , there must be a residual set 
of II in RI at the points of which g is densely approached. This dense ap 
proach is clearly valid via RI and for the function/. Let E 2 be the exhaustible 
subset of points of jRi at which / is not densely approached via RI . It will 
now be seen that at every point of the set R = II EI -- E 2 ,f is densely and 
inexhaustibly approached via R, and furthermore that / is inexhaustibly 
approached at every point P of R via N &lt; R, the common part of N &lt; and R, 
where N &lt; is any partial neighborhood of P . Every point of R is inexhaustibly 
approached via RI and hence via R, since an exhaustible set has no effect 
upon inexhaustible approach. Moreover every point P of R is densely ap 
proached via RI; i.e., if A 7 &lt; is a partial neighborhood of P , and e a positive 
number, there exists a point Q of N &lt; RI at which/ differs from/ ( P ) by less than 
e. Since, however, / is inexhaustibly approached at Q via R, and since R 

* f is densely approached at P via M , which implicitly has P as limit, if for every partial 
neighborhood jV&lt; of P such that jV&lt; M has P as limit, the set (JV&lt; M) , i.e., the set of 
surface points corresponding to the points of N&lt; M , has P as a limit. 
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differs from RI by an exhaustible set, it follows that Q is a limit point of R , 
and therefore N &lt; contains a point of R at which / differs from / ( P ) by less 
than e. P is thus densely and inexhaustibly approached via N &lt; R . 
If / is unbounded, we use the transformation 



i + !/(*,)! 

The new function / is bounded, but the properties of dense and of exhaustible 
and of inexhaustible approach are preserved by this transformation. We may 
therefore drop the restriction made above in reference to boundedness. 

THEOREM III. With every function f (x , y) there is associated (not iiu///iir!i/, 
however) a dense set D of the XY plane such thatf (x , y) is contin nous, if ( .r , y ) 
ranges over D . 

Proof. We shall say that a set of points is " e-s paced" (in the plane II) if 
every circle of radius e contains at least one point of the set. Let e\ , e 2 , e 3 , 
be a decreasing sequence of positive numbers with lim e m = . Let K i be 
any set that is everywhere dense in II and at the points of which f(x, y) is 
densely approached via K\; such a set exists according to Theorem II. Since 
KI is everywhere dense in II , we may select in A i a subset 



In, 



which is isolated (i.e., no point of D is a limit point of D) and ei-spaced. We 
enclose each Pi n in a circle Ci n in such a way that (a) no two C\ n s overlap 
for this purpose it is sufficient to make the radius of C\ n less than one half the 
greatest lower bound of the distances from P\ n to the other points of D\; 
and that (b) Ci n contains a dense subset of A~i at every point of which the 
value of/ differs from/ ( P in ) by less than e\ the existence of such a subset for 
a sufficiently small radius of C\ n is guaranteed by the fact that / is densely 
approached at P Jre via KI. Denote by Jm the set of all the points in K l 
that are in the interior of C ln and at which /differs from/( P Jn ) by less than e\ , 
and by A 2 the subset of KI constituted by all the points of all the Ji n and the 
points of KI that lie in the interior of II -- " = i Cm At every point Q of A 2 
/ is densely approached via K 2 . For if Q is an interior point of II d , this 
property is obvious. If, however, Q is in d n , let / ( Q ) - / ( P ln ) = l - d , 
8 &gt; . /is densely approached at Q via points of A"i at which / differs from 
/(Q) by less than 5; that is, via points where / differs from /(P ln ) by less 
than ei ; hence via A 2 . Since K 2 is everywhere dense and / is densely ap 
proached via A 2 at every one of its points, we may treat it as we did A\ . We 
select an isolated, e 2 -spaced subset Z) 2 = { P 2 i, P 2 2, , P 2n , } of K 
such that D-i contains every point of Dj . We now enclose each P 2n in a 
circle C 2n in such a way that (a) C 2n contains a dense subset of points of A 2 
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where the value of /differs from/( P 2n ) by less than e 2 ; (6) no two (7 2rt overlap; 
and (c) C 2n lies either entirely within a C ln or entirely outside of all C\ n 
Denote by J 2n the set of all the points of (7 27l A 2 at which / differs from 
/ ( Pin ) by less than e 2 , and by K s the subset of K z constituted by all the 
points of all the J 2n and the points of A 2 , lying neither in the interior nor 
on the boundary of any Cz n The set A~ 3 , like A 2 , is everywhere dense and 
such that at every point of it / is densely approached via A* 3 , and the process 
may therefore be continued. The set 

7") - J P P P ) 

l^TTi ~" ( -* ml j -t ml &gt; &gt; -L mn ? ( 

is an e m -spaced. isolated subset of K m and a superset of T) m -\ The circle 
C mn enclosing P mn (a) contains a dense subset of points of K m where / 
differs from / ( P mn ) by less than e m , (b) has no points in common with ( ,,, 
for p T n and (c) lies entirely outside of all C pq , p &lt; in, if P mn is outside of 
all Cpq for p &lt; in, and entirely inside every C pq , p &lt; in , containing P mn . 

We now define the set D of the theorem to be the sum of the D m s ( m = 1 , 
2 , ) That D is dense follows from the fact that D m is e m -spaced. If 
P mn is a point of D , it lies in every circle of a sequence of circles C mn , 
t m+i, n w , , such that at the points of C m+Pt n m that belong to D the func 
tion / differs from / ( P mn ) by less than e m+p . This proves the asserted 
continuity. 

2. GENERALIZATIONS 

So far we have dealt only with real functions of two real variables. It is 
apparent that the considerations apply equally well to functions of a single 
variable and to functions of n real variables. The results are, however, 
essentially of a still more general character. Without entering upon extreme 
refinements of generalization we may note that the arguments in the preceding 
section are substantially valid for every function / ( P ) , defined in a set @ 
satisfying the following conditions: 

(1) @ is a metric;* that is to say, with every pair of elements P and Q of &lt;S 
there is associated a non-negative, real number PQ (Frlchet s ecart) in such a 
w T ay that if P , Q , and R are any three elements of &lt;S , then 



(a) PQ = QP; 



(b) PQ = 0, when and only when P = Q; and 



(c) PQ + QR^PQ. 
(2) @ is a complete space (vollstandiger Raum);f that is to say, if 



* Cf., for example, Frechet, Sur quelques paints du calcul fonctionnel, Rendiconti 
del Circolo Matematico di Palermo, vol. 22 (1907), p. 1, and Hausdorff, 
Grundziige der Mengenlehre, 1914, p. 211. 

t Hausdorff, loc. cit., p. 315. 
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, P n , } is a "regular" sequence of elements of , in other words, if 
for every e &gt; there exists an integer n e such that P x Pp. &lt; e for X &gt; e 
and ft &gt; n e , there exists a limit element P (i.e., an element P with the property 
, P^P = 0). 



(3) & contains a denumerable subset that is dense in @ . 

(4) @ has no isolated points. 

The reader will have no difficulty in introducing in Section 2 the slight 
modifications that are requisite for making the definitions and the reasoning 
applicable to a metric, complete space with a dense, denumerable subset 
and without isolated points. By way of illustration, we show * that the 
distinction between exhaustible and residual sets may always be made for 
such a space. For suppose M\ , J/o , , M n , are all non-dense subsets 
of @ . Let ei, ez, 3, be a sequence of positive numbers with lim e n = . 
Since MI is non-dense in &lt;3 , its complement MI - - M i contains a point . I , 
which is in the interior of M\ (i.e., which is not a limit point of M\ ) . Let C\ be 
a "sphere" | with center A\ and radius &lt; t\ containing only points of J/i . 
C\ contains in turn an interior point A?, of J/ 2 = - M* . Let C 2 be a 
sphere with center A^ and radius &lt; 2 lying entirely in Ci and containing 
only points of M 2 We thus define the sequence of spheres C n (n = 1,2, ) 
The sequence of points AI, Az, is evidently a regular sequence, since 
AX, A &gt; n, lies in the sphere C n , hence its distance from A n is less than e,, , 
and therefore according to property (1) of the distance between AX and 
Ap (X &gt; n, fj, &gt; n) is less than 2e. Therefore, according to property (2) 
of @ there exists a point A of which is the limit of A n .\ A lies in every 
C n , hence outside of every M n . Therefore 3 cannot be represented as the 
sum of a denumerable set of non-dense sets; that is, is not exhaustible. 
Since the sum of two exhaustible sets is exhaustible, it follows that a residual 
set, which is the complement in of an exhaustible set, is not itself exhaustible. 

Further details of the extension of the ideas of the preceding section to @ 
we leave to the reader. We may state the following 

THEOREM IV. Let &lt;S be any complete, metric space without isolated points and 
containing a dense, denumerable subset; andf(P) any real function defined for 
the elements P of &lt;5 . Then there exists a residual set R such that if P is a point 

* Cf., for example, Hausdorff, loc. cit., chap. VIII, Section 9, especially Theorem VI and 
p. 328. 

t A "sphere" of with center P and radius r is the set of points of @ whose e"cart from P 
is ^r. 

Jit is to be observed that if the radius of C n does not approach 0, no conclusion with 
reference to the existence of a limit point of [A n ] can be made. For this sequence would 
then not necessarily form a regular sequence; there may therefore be no limit point unless &lt; 
were assumed to be compact. In this case, however, function space would not be an instance 
for our results, since a "sphere" of function space, while possessing the properties demanded 
of @ , is not compact. 

In the terminology of E. H. Moore, / is a function on @ to A , where A is the set of real 
numbers. 
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of R, and N &lt; a partial neighborhood of P , the function f is inexhaustibly, and 
therefore densely, approached at P via RN &lt; . Also there exists a dense siibset 
D of such that f (P) is continuous if P ranges over D . 

As particular examples of a complete metric space with a dense denumerable 
subset and without isolated points, we mention: 

(a) Euclidean tt-space where the ecart between two points is the euclidean 
distance between them. 

(6) A perfect subset of a euclidean space. 

(f) Hilbert space, that is, the ensemble of sequences ( .TI , x 2 , , x n , ) 
of real numbers with convergent 5Z"=i x%. The ecart between two "points" 
(a i ,xz, ,%n, ) and (2/1,2/2, , y n , ) is defined to be 



The denumerable set consisting of all the sequences ( r\ , r 2 , , r n , , , , 
) , where the r s are rational numbers, is dense in . Furthermore, is 
evidently complete. For if P , P" , , P (n) , is a regular sequence of 
points of , and P (n) == ( x ( P , x ( ^ , ) , then for e &gt; an integer i exists 
such that 



for p &gt; i and q &gt; i. Hence x^ converges to x n , say; therefore, P ( " to 
(TI &gt; x%, , a n , ) &gt; which, as may be seen, again belongs to Hilbert space. 
, of course, has no isolated points. 

(d) Function space: consists of all real continuous functions /( .r ) defined 
for 5i x = 1 . The ecart between /i ( .r ) and/ 2 ( x ) is defined to be max \ f\(x) 
- / 2 (.r) | ; the postulates for ecart are then satisfied. That there is a dense, 
denumerable subset follows from the theorem of Weierstrass that every con 
tinuous function is the limit of a uniformly convergent sequence of polynomials, 
which may be assumed to have rational coefficients and are thus denumerable. 
Moreover, since the limit of a uniformly convergent sequence of continuous 
functions is continuous, it follows that a regular sequence has a limit, and 
therefore is complete. obviously has no isolated points. 

The assumption that / is single-valued may also be dropped without in 
validating Theorems I and II of the last section; Theorem III, of course, 
implies single-valuedness by its very nature. We thus get the following 
results: 

THEOREM I (a) . Letf(x, y] be any real function defined for the entire XY 
plane and taking at every point at least one value; the number of values may 
change, however, from point to point and vary from 1 to c , the cardinal number of 
the continuum. Then the points (x , y] such that every surface point (x , y, 
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f( x &gt; y)) i s densely approached by the surface z = f(x, y) constitute an ex 
haustible set. 

THEOREM II (a) . If f(x,y) is any real, single- or many-valued function 
defined for the entire XY plane, there exists in the XY plane a residual set R, 
such that if ( x , y ) is any point of R , and N &lt; a partial neighborhood of ( .r , y ) , 
then every point (x, y ,f(x , y)) is inexhaustibly approached via RN&lt; . 

The proof for these more general results is essentially the same as that for 
Theorems I and II and is left to the reader . 

Theorems I (a) and II (o) hold also for any complete, metric space with a 
dense, denumerable subset and without isolated points. 

3. METRIC PROPERTIES 

As in the case of the descriptive properties of Section 1, we shall confine the 
discussion in this section to planar sets and to one-valued functions of two 
variables. 

Let S be any planar set; P , a point of S; C T , a circle (interior and boundary) 
with P as center and r as radius; m ( C r ) , the area of C T , and m e (SC r ) , the 
exterior Lebesgue measure of the portion of S in C r . Then if 

Km ? -^4^- } 

r-*0 111 ( Cr ) 

exists and is equal to I, we shall say that the exterior metric density of S at 
the point P is / . We have the following 

THEOREM V.* Let S be any planar set. Then the points of S at which the 
exterior metric density of S is ^ 1 , i.e., the points ichere the exterior metric 
density either does not exist or does exist and is &lt; 1 , constitute a set of zero measure 
(Lebesgue}. 

Proof. If P is a point at which the exterior metric density of S does not 
exist or exists and 5^ 1 , the 

,. . e m e (SC T ) 
hmmf- \ 

r-&gt;0 711 (Cr) 

must be &lt; 1 . Let Tk be the totality of points of S where 

r . t m e (SC r ) , , 

lim ml - -&lt;*&lt;!. 

r-*0 TO (G r ) 

The set whose measure we are to prove equal to zero is the sum of Ti,* , T 3/i , 
, T(n-i)m , , and since the sum of N sets each of zero measure is again 

* For the case where only measurable sets are admitted, cf . Lebesgue, Legons sur I Integration, 
1904, pp. 124-125; Denjoy, Journal de Mathematiques, ser. 7, vol.1 (1915), 
p. 132; Lusin and Sierpinski, Rendiconti del Circolo Matematico di 
Palermo, vol. 42 (1917), p. 167; de la Vallee Poussin, Cours d Analyse, vol. 2, 1912, p. 114. 
For the linear case of general (not necessarily measurable) sets, cf. Blumberg, Bulletin 
of the American Mathematical Society, vol. 25 (1919), p. 350. 
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of zero measure, it is sufficient to prove that Tk is of zero measure. Let m be 
the exterior measure of Tk so that 2\ may be enclosed, in such a way that 
every point of Tk is an interior point of at least one C n , in a sequence of circles 
C n , n = 1,2, , with the sum of their areas &lt; m + e, where e is arbitrarily 
small. Associate with every point P of Tk a circle C P lying in the interior of 
one of the circles C n , and such that 

m e (C P S) j f 

and, a fortiori, 

( n T. \ 

&lt; k, 



m e (C P T k ) 



m(Cp) 
where 

m ( C P ) = area of C P ; 

this is possible because of the assumed property of the points of Tk From 
the circles C P a denumerable number may be extracted having the same 
totality of interior points, and from this denumerable number, a finite number 
C , C" , , C (K) , that cover Tk except for a set of exterior measure &lt; e . 
We now T select from the C (n) a subset of non-overlapping circles as follows. 
Let D be a C (n) having the maximum area attained by the C M ; remove 
from the set {C M } the circles which intersect D , and let ])" be one of the 
circles left, and having the maximum area for these circles; remove now all 
circles intersecting D" , and let D " be one of the circles now remaining and 
having the maximum area for all circles intersected neither by D nor by D" . 
In this way we obtain a set of non-overlapping circles D M . Let i represent 
the area of the portion of the plane covered by one or more C M , and a 2 the 
portion of the plane covered by the D M ; then it follows from the manner in 
which the D M were chosen that 

1 

9 
Since every Z) (n) is a C M we have 

/ 7~l ( "n~\ fri \ 

!.(/&gt; T k ) ^ ,.. 



and since the D (n) do not overlap, we may cover the subset of Tk that lies in 
the D w by means of circles having a total area &lt; ka . We may thus cover 
Tk by means of circles having as sum of their areas a number less than 
ka-z + ( fli 2 ) + , the first term representing a sufficient amount for the 
portion of Tk in the D (n) , the second term, for the portion of Tk in the C M 
and outside of the D (n) , and the third term, for the portion of Tk outside of 
the C (n) . Therefore, 

m &lt; A-a 2 + (ai a 2 ) + e i ~ (1 k)a z + e; 
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and in virtue of the inequalities 



we have 
whence 



m e &lt; fli &lt; in + e, a z &gt; -GI, 



m &lt; m -f e - ( 1 k ) ( m t ) + 



( 19 - fe ) 



1 7- 

1 n? 

and therefore m = . 

Definition. N &lt; is said to be a non-vanishing partial neighborhood of P, if 
the exterior metric density of N &lt; at P is ^ . 

We have the following lemma which corresponds to Lemma I in the case 
of the descriptive properties. 

LEMMA II. Let $R be a closed relation as in Lemma I. The points P for 
which (a) A T 3?P for every neighborhood of P and (b) a non-vanishing partial 
neighborhood N &lt; exists such that N&lt;WP (i.e., A r &lt; 9tP is false) constitute a set of 
zero measure. 

Proof. Let T be the totality of points P having the properties (a) and (6) . 
We may then show as in the proof of Lemma I that N &lt; 9?Z&gt; for every point B 
of NN&lt; , which, if N&lt; is a non-vanishing neighborhood of P , is also a non- 
vanishing neighborhood of P . Every point P of T thus has a non-vanishing 
neighborhood every point of which does not belong to T; therefore at no 
point of T is the exterior metric density of T equal to 1 ; T is therefore of zero 
measure. 

Let z = f(x, y) be a given surface; P = (x , y) , a point of the AT plane; 
and the relation 9? n r 2 , r\ &lt; r z (as in the case of Section 1), such that 79i ri r 2 P 
when and only when there is a point ( x , y , f ) , ri ^= f : r-&gt; , which is a limit 
of / . It follows as before that 9? ri r 2 is closed. Therefore, by means of 
Lemma II, we may conclude that the points P for which (a) Xlrlr^ P for 
every neighborhood N of P , and (fe) A 7 9? ri r 2 P for some non-vanishing partial 
neighborhood of P constitute a set T ri r 2 of zero measure. Let T be the sum 
of all the sets T ri r 2 , as r\ and r- 2 , rj &lt; r 2 , vary over all the rational numbers; 
T, being the sum of fr$ sets of measure zero, is itself of measure zero. Now a 
point of the surface z = / ( x , y) is either isolated (and then ( x , y ) belongs to 
a denumerable set of points and hence to a certain set of measure zero) or 
else it is a limit point of the surface, and then NSftr^ P for every neighborhood 
N of P , if ri &lt; 2 &lt; r 2 . We therefore have the following 

THEOREM VI. Letf(x, y) be any real, one-valued function defined in the 
entire plane. Then there exists in the AT plane a set Z , dependent on f, of 
measure zero, such that if (a) (x, y) is any point of the XY plane not belonging 
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to Z ; (6) N&lt; , any non-vanishing partial neighborhood of (x , y) ; and (c) S , any 
sphere with ( x , y , f ( x , y ) ) a* center; then there is at least one point of the surface 
z = f(x, y) lying in the sphere S and having as projection upon the XY plane 
a point in N &lt; . 

That the theorem becomes false if we omit the restriction that the partial 
neighborhood N &lt; shall be non-vanishing, is seen from the following example. 
Let A be a planar non-dense perfect set of positive measure, and B its com 
plement. Let f(x, y) = in ^1 and 1 in B. Since A is non-dense and 
closed, B is an everywhere-dense /-region and therefore every point (.r, y) 
of A has as one of its partial neighborhoods a set A T &lt; , consisting exclusively 
of points of B . For such an / ( x , y ) , ( .r , y ) and N &lt; , however, the assertion 
of the theorem is false, although, since A is not of zero measure, there are 
points of A not belonging to the alleged set Z of the theorem. Of course, in 
virtue of Theorem V, the points of A , with the exception of those belonging 
to a set of measure zero, have none but vanishing partial neighborhoods that 
consist entirely of points of B . 

It is obvious, as, indeed, the last example shows, that the points at which a 
function is not densely approached need not be of zero measure. If in passing, 
however, we alter the meaning of "densely approached" in a rather natural 
way, we may conclude, according to Theorem VI, that /is densely approached 
everywhere except at the points of a set of measure zero. The new definition 
of dense approach is as follows : / is said to be densely approached at P if for 
every e &gt; and for every non-vanishing partial neighborhood A r &lt; of P , there 
is in N &lt; a point different from P where / differs from / ( P ) by less than e . 

Let now the relation 9? ri r 2 , ri &lt; r 2 , be such that /9?nr 2 P when and only 
when there is a point ( x , y , f ) , r\ i f =i r 2 , inexhaustibly approached via 7 ; 
that is to say, for every e &gt; and for every circle C of the XY plane with P 
as center, there is an inexhaustible set of points in CI at which / differs from f 
by less than . Then it follows that 9? r ,r 2 is closed. If / is inexhaustibly 
approached at P and a non-vanishing partial neighborhood N &lt; of P exists via 
which / is not inexhaustibly approached at P , then there exist two rational 
numbers r\ and r 2 , ri &lt; / ( P ) &lt; r 2 , such that A 7 9?r 1 r 2 P for every neighbor 
hood N of P and A^&lt; 9? ri r 2 P . Therefore according to Lemma II, P belongs 
to a certain set Z ri r 2 of measure zero associated with the closed relation 9? r ,rj. 
The sum of the Z rir , for all possible rational numbers r\ &lt; r 2 is also of measure 
zero; we thus have 

THEOREM VII. The set of points P of the XY plane, at ichichfis inexhaustibly 
approached and for which a non-vanishing partial neighborhood exists via which 
f is exhaustibly approached at P , constitute a set of measure zero. 

It is, of course, not true that the points where / is exhaustibly approached 
form a set of measure zero. Because of this fact the result for inexhaustible 
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approach is not as pleasing as that for the case we shall consider next. We 
shall say that / is neglectably approached at the point P , if a sphere S exists with 
P as center such that the projection upon the XY plane of the points of the 
surface that lie in S constitutes a set of zero measure; / is said to be neglectably 
approached at the point P via N &lt; if a sphere S exists with P as center such that 
the projection upon the XY plane of A T &lt; S has in common with N&lt; a set of 
measure zero. We have the following 

THEOREM VIII. Let z = f(x ,y) be any real, single-valued function defined 
in the entire XY plane. Then the points P of the XY plane that possess a non- 
vanishing partial neighborhood via which f is neglectably approached at P con 
stitute a set of zero measure. 

The proof of this theorem, which is a generalization of Theorem VI, is 
analogous to that of the latter. A few indications will therefore suffice. We 
note first that the set of points where / is neglectably approached is of measure 
zero; the proof is similar to that of the exhaustibility of the set of points at 
which/ is exhaustibly approached.* We then define 7$H ri r 2 P , P = (x , y) , 
as follows : there is a point ( x , y , f ) , r\ = " = r 2 , such that if S is any sphere 
with ( x , y , f ) as center, and the surface points in S are projected upon the A" 1" 
plane, the portion falling in I forms a set of positive exterior measure; i.e., 
( x &gt; y &gt; f ) is n t neglectably approached via /. It will be seen that ^R ri r 2 is 
closed. The rest of the argument is left to the reader. 

Theorem VIII asserts that certain sets are of positive exterior measure. 
We shall go a little further by considering the question of exterior metric 
density. Let P be any point and M pt the set of points Q for which 



We shall say that / is quasi-continuous] at P if for every f the set M Pe has 1 
as exterior metric density at P. We have the following theorem, which 
generalizes Theorem VIII: 

THEOREM IX. / is quasi-continuous except at the points of a set of measure 
zero. 

Proof. Let S ri r t , ri &lt; r 2 , be the set of points (x , y) for which 

ri &lt;/(.v , y) &lt; r t , 

and T Tl r t be the set of points where the exterior metric density of S TlTt is not 1 ; 
TVjrj is therefore of measure zero according to Theorem V. If r\ and r 2 are 
rational, we have in all t$ sets T Tl r 2 , and thus the points in all the T Tl r 2 con 
stitute a set Z of zero measure. If P is a point at which / is not quasi-con 
tinuous, then, according to definition,, there exists a positive number e such that 

* See Section 1. 

t Cf. Denjoy, Bulletin de la Societe Mathcmatique de France, 
vol. 43 (1915), p. 165. 
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M pe is not of exterior metric density 1 at P. Let TI and r 2 be two rational 
numbers such that 



/(P) -*&lt;n &lt;/(P) &lt;r 2 

It follows that P belongs to T Tl r t and therefore to Z . 

Suppose now that N &lt; is a partial neighborhood of P such that if C r is a 
circle with P as center and r as radius then 

,. . ,m(C r N &lt; } 

lim inf - - = k &gt; ; 

r-t-O m(Cr) 

the symbol m denotes Lebesgue measure, which N &lt; , as an /-region, possesses. 
We shall then say that N&lt; is a "properly non-vanishing partial neighborhood" 
of P . Suppose further that a positive e exists such that 

,. . ,m e (C r N &lt; M Pe ) , 

lim inf T7r^f~^ =l&lt;l. 

r-*o m(C T N &lt; } 

We shall then say that the exterior metric density of M pt is ^ 1 at P via N &lt; . 
It follows then that the surface z = f(x , y} is not quasi-continuous at P . 
For if 5 is any positive number, there exists a circle C r with arbitrarily small 
radius and with P as center such that 

m(C T N &lt; ) &gt; (k - S)ro(Cr), 
and 

m e (C T N &lt; M Pe ) &lt; (l + 8)m(C r N &lt; }. 
Therefore 
m e ( C r U P( ) = m e ( C r N &lt; M Pf ) + m e ( C r { C r - N&lt; } M P( ) 

&lt; [(J + )(fc-) + 1 - (k-d)}m(C r ) 
-- [1 - (1 -Z- S)(fc-S)]wi(C r ) 

for sufficiently small 6 ; C r N &lt; means here the portion of C r not in N &lt; . 
Since I &lt; 1 , k &gt; , and 5 and r may be made as small as we please, we con 
clude that the relative exterior measure of M pe is not 1 at P; that is, that / 
is not quasi-continuous at P . The totality of such points P is therefore of 
measure zero. W T e thus have 

THEOREM X. If P is a point not belonging to a certain set of measure zero, 
then for every positive the exterior metric density of M Pe is unity at P via any 
properly non-vanishing partial neighborhood of P . 

Since an angular region at P, i.e., the portion of the AT plane bounded by 
two half lines radiating from P , is a properly non-vanishing partial neighbor 
hood of P , it follows as a corollary of Theorem X that, if we neglect a set of 
measure zero, the exterior metric density of M pe is, for every e &gt; , equal to 
unity at P via every angular region ivith P as vertex. 

Trans. Am. Math. Soc. 9. 
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CONCLUDING REMARKS 

As in the case of the descriptive properties, it may be seen that the metric 
theorems may be extended to many-valued functions. Theorem VIII, for 
example, when thus generalized, would read as follows : Let z = f(x , y) be 
any real, single- or many-valued function defined in the entire XY plane. Then 
ike points ( x , y ) of the X Y plane for which a surface point ( .r , ;/ , / ( .r , y ) ) and 
a non-vanishing partial neighborhood. A T &lt; exist such that (x , y, /(.r, y)) ts 
neglcctably approached via N &lt; constitute a set of zero measure. 

It is evident that the metric properties hold for functions of a single variable 
and, in general, for functions of n variables. Extension to No-space and to 
function space would require a satisfactory definition of measure for such 
spaces;* it is not our purpose to enter upon such questions here. Likewise 
we shall not attempt to define, by means of postulates, a general space for 
which our metric results are to hold. 

It may be remarked that instead of projecting the surface points of 
z = f(x , y) upon the XY plane vrc may project them upon the A -axis and 
thus obtain other properties. For example, let us define the relationship 
^ r i r 2 r 3 r 4 (between /-regions and points of the A -axis) as follows: /9u ir2 r 3 r 4 , 
if the surface points having .r-coordinates in / have a limit point in the rec 
tangle x = , ri = y = r 2 , r 3 ^ z ^ n 9?r ir2 r 3 r 4 is evidently closed. If the 
r s are taken to be rational, the number of possible relations 9? ri r r 3 r 4 is No, 
and we may, by the aid of Lemma II and an argument repeatedly employed 
in this paper, obtain the following result: Let z = /(.r, y) be any single- or 
many-valued function defined in the entire XY plane. Let be a point of the 
X-axis of the following character: a surface point (, r/ , f ) and a partial non- 
vanishing (linear) neighborhood N&lt; of exist such that (, 77, f ) is not a limit 
point of surface points having x-coordinates in N &lt; . The totality of points is 
of measure zero. 

Similar results may be obtained for other metric properties and also in the 
case of the descriptive properties. In the case of a function of n variables, 
we may project upon an (ti -- 1 )-space, an (re 2)-space, etc. 

* In this connection cf. Gateaux, Bulletin de la Societe Mathematique 
de France, vol. 47 (1919), p. 47. 

UNIVERSITY OP ILLINOIS, 
URBANA, ILL. 



A FUNDAMENTAL SYSTEM OF INVARIANTS OF A MODULAR 
GROUP OF TRANSFORMATIONS* 

BY 

JOHN SIDNEY TURNER 

1. Introduction. Let G be any given group of g homogeneous linear trans 
formations on the indeterminates .TI , , x n , with integral coefficients taken 
modulo in. Hurwitzf raised the question of the existence of a finite funda 
mental system of invariants of G in the case where m is a prime p , and obtained 
an affirmative answer when g is prime to p . Dicksonf subsequently obtained 
an affirmative answer for any g . 

The general case presents great difficulty, owing to the fact that resolution 
into irreducible factors with respect to a composite modulus is not, in general, 
unique. The present investigation is confined to the case in which there are 
two indeterminates x , y , and m is the square of a prime p . The given group 
will be denoted by H , the notation G being retained when m = p. It is 
proved that the p 2 + 1 invariants 

L p , Q p , pL" Q e ( a , 13 = , 1 , , p - I ; a , not both zero), 



where 



L = yx* -xyv, Q = (a* 2 - 1 - 



form a fundamental system of (independent) invariants of the group H . 

2. Consider the group H of all linear homogeneous transformations 
modulo p 2 : 

(1) x = ax + by , y = ex + dy , ad be = 1 (mod p 2 ) , 

where a , b , c , d are integers. To each transformation of H corresponds a 
unique transformation of the group G: 

(2) x = a-i x + biy, y = cix + diy, GI d t - bi ci = 1 ( mod p ) , 
where ai , bi , Ci , f/i are integers. In fact, we have only to choose 

i = a, , di = d (modp). 

Conversely, to each transformation (2) corresponds one or more trans 
formations (1). For, we can choose a=ai, , d= di (mod p) so that 

* Presented to the Society, April 15, 1922. 

fArchiv der Mathematik und Physik (3), vol. 5 (1903), p. 25. 

t The Madison Colloquium, Lect. III. 
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ad be = 1 ( mod p 2 ) . For example, if ai ^ ( mod p ) we may take 
a=ai,b=bi,c=ci( mod p ) , and determine d by ad be = 1 ( mod p 2 ) ; 
evidently d = d\ ( mod p ) . 

Hence if we reduce all the transformations of H modulo p , we obtain all the 
transformations of G . 

3. Definition. A rational and integral invariant of H is a polynomial 
7 ( x , y) in x and y with integral coefficients, which remains unchanged 
modulo p 2 under every transformation (1). That is, 

(3) I(x ,y ) = I(ax-\-by,cx + dy)mI(x,y} (modp 2 ) 

for all integers a , , d such that ad be = 1 ( mod p 2 ) . 

Evidently any rational and integral invariant is a sum of homogeneous 
invariants; hence we restrict the investigation to the latter. 

4. THEOREM I. Let I (x, y) be a rational and integral invariant of H , and 
let Ii(x , y) be the polynomial obtained from I (x , y) by replacing each coefficient 
by its positive or zero residue modulo p. Then Ii(x, y) will be a rational and 
integral invariant of G . 

We have (3) for all transformations of H . Now 

I(x,y)=Ii(x,y} (modp) 

and 

I(ax + by,cx + dy)mli(ax + by , ex + dy ) 

= li(aix + bi y, c l x + di y} (mod p) , 
hence 

(4) Ii(x , y } = /iOi.r + biy, dx + d t y) = Ii(x, y} (mod p) , 

and by 2 this is true for all transformations of G . 

5. Now (Madison Colloquium, pp. 34-38), 

h(x,y}=kT^T?- ... Zf... 77-* (modp), 

where k is an integer, 



Ri being a polynomial in its two arguments, with integral coefficients ; moreover 
the Ti (i = 1,2, , r} contain no multiple factors, and are relatively 
prime modulo p . Hence 



(5) I(x,y)mlcT?-T - T?- - T? 

where F ( x , y} denotes a polynomial in x , y with integral coefficients. 

* In the discussion which follows, if any a; is zero the corresponding Ti is to be suppressed . 
t If p = 2 , we omit the divisor 2 in the exponents. 



1922] INVARIANTS OF A MODULAR GROUP OF TRANSFORMATIONS 131 

6. Discussion of equation (5). Apply to I (x , y) the transformation 
(6) x = x + py , y = y ( mod p 2 ) , 

expand by Taylor s Theorem, and denote the partial derivative of Ti with 
respect to x by 7" . Then 



,, 

(t ) 

X Tr~ l Z ,- T-, r,_i T\ T i+l T r (mod p 2 ) . 



Since (6) is a transformation of H , 

I(x + py, y) = I(x, y} (modp 2 ). 

Hence either k = ( mod p ) , in which case the right member of (5) reduces to 
its second term, or 

(8) Z a,- T, T^, T i T i+l T r = (mod p) . 
i=i 

Let (gi, 1 ) be a point at which Ti (x , y) vanishes. Then, for,; ^ i , Tj (x , y) 
cannot vanish at ( g, , 1 ) ; for, in that event, Tj(x, y} would be a factor of 
Ti(x, y) modulo p ,* contrary to 5. Therefore from (8) we have 

(9) aiT ^gi, 1) = (modp). 

Hence either a,- == 0, or 7 1 (&lt;?;, 1)^0 (mod p) . In the latter case, by a 
known theorem on Galois imaginaries, Ti ( .r , 1 ) and T { ( x , 1 ) have a common 
factor with integral coefficients modulo p . But ( 5) Ti(x, 1 ) contains no 
multiple factor modulo p . Therefore! T\(x , 1 ) = ( mod p ) , whence 

(10) T i(x,y)=Q (modp). 

Hence we have 

THEOREM II. In equation (5), for each i = 1 , , r , either on is a multiple 
of p, or T i(x, y) = Q (modp). 

COROLLARY 1. a.\ = (mod p) . 

For Ti = yx p - xy p ; hence T( == pyx*** 1 - y p H= (mod p ) . 

COROLLARY 2. o? 2 = ( mod p ) . 

For 



hence 7^ s|i ( mod p ) . 

COROLLARY 3. // on = pfiifor i &gt; 2, 

(11) T a i mS i (D ,Q^ (modp), 

S is a polynomial in its arguments, with integral coefficients. 

* The Madison Colloquium, p. 38. 

f Dickson, Lecture Notes on Double Modulus and Galois Imaginaries, 5. 
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For if we expand 



we observe that in each term the exponent of L is a multiple of p and that 
either the exponent of Q or the coefficient of the term is a multiple of p . 
7. Discussion of T \ ( x , y ) = ( mod p ) . Write 



T=0 



(12) Ti(x,y) - 

where I = L* p(p ~ l) ,* q = Q* (p+1) , and the coefficients A T are integers; then 
T i( x , y ) = E J r ( n -- r ) Z""^ 1 / f/ + J, r/- r &lt;T 9 = ( mod p ) , 



where V , q denote the partial derivatives of I, q with respect to x . Evidently 
I = , q =| ( mod p ) ; therefore 

(13) rA T l n ~ T q*- 1 = ( mod p } . 

Each term of (13) is the product of the preceding by cq/l , c a constant; the 
degree in x of q/lis \ (p 2 - p ) ,| hence the degrees in x of the successive terms 
increase by f (p 2 - p) . Equating coefficients of x, we find in succession 

nA n ==Q, , ?vl r =0, , AI = (rnodp). 

Hence in each term of " =1 A T l n ~ T q r , either the coefficient A r or the exponent 
of q is a multiple of p , and we have 
THEOREM III. // T ( .r , y ) m ( mod p ) , then 

(14) T 1 (x,y)=S l (L p ,Q p ) (modp), 

ichere Si denotes a rational and integral function of its arguments, with integral 
coefficients. 

COROLLARY. [ T l ( x , y ) ] a &gt; is a polynomial in L p ,Q P , with integral coefficients, 
modulo p . 

8. THEOREM IV. L p is invariant under the group H . 

Write L(x , y ) = e, L(x, y} = f , where x , y are derived from x, y by 
any transformation (2) of the group G; thenj e / = ( mod p ) . Hence 
e /= (mod p) for every transformation (1) of H . For if, as in 2, we 
choose ai = a, , di = d modulo p, we have 

L(ax + by, ex + dy) = L(ai x + hi y, d x + d l y) ^L(x, y) (mod p) . 

* If p = 2 , we omit the divisor 2 in the exponents. 

t If p = 2 , the degree is 2. 

J The Madison Colloquium, p. 35. 



1922] INVARIANTS OF A MODULAR GROUP OF TRANSFORMATIONS 133 

Also 

e p -f p = (e -f+D p -f p 



and each factor on the right is identically congruent to zero modulo p; hence; 
e p - f p = ( mod p 2 ) ; that is 

[L(ax + by, ex + dy)] p = (L(x,y)} p (mod p 2 ) 

for every transformation of II . 

COROLLARY 1. In the same way, it can be proved that Q p is invariant under 
the group H . 

COROLLARY 2. pL" Q* 3 is invariant under the group II . 

9. THEOREM V. Any rational and integral invariant of the group H is cc 
rational and integral function, with integral coefficients, of the p 2 + 1 invariants 
L p , Q" , pL a Q (a, P = 0, 1, , p -- 1; a, P not both zero). Conversely,, 
any such function is an invariant of H . 

In (5), the term kT" 1 T%* T" r is an invariant of H . For if any a z = 
( mod p ) , then by Theorems II and IV with their corollaries, T" is an in 
variant; Ti == L , T z = Q , on == (xi = ( mod p } . While if a } - ^ ( mod p ) , 
T J = 0, and by Theorems III and IV with their corollaries T"&gt; is an invariant^ 

Hence the second term pF (x , y) of (5) is an invariant of H , and it follows 
from 2 that F ( x , y) is an invariant of G . Therefore pF (x , y) is the 
product of p by a polynomial in L and Q . It follows that if I ( x , y) is any 
rational and integral invariant of H , 

(15) / (x , y) = S ( L p , Q p , pL a Q^) (mod p 2 ) t 

where pL" Q denotes the set pL , pQ , pL 2 , pLQ , , pLp~ l QP~ I , and S 
denotes a rational and integral function of its arguments, with integral 
coefficients. 

Conversely, any rational and integral function of L p , Q p , pL a Q" , with 
integral coefficients, is a sum of invariants, and is therefore itself an invariant. 
Hence these p 2 + 1 invariants form a fundamental system. 

10. THEOREM VI. The invariants of the fundamental system are independent. 

In view of the coefficients p , neither L p nor Q p can be expressed as a poly 
nomial in the remaining invariants, with integral coefficients. Assume that 
pL" 1 Q 01 , i = p 1 , /?i = p 1 , can be so expressed. Then 

(16) pL"i Q^ = P ( LP , Q p , pL a Q" ) ( mod p 2 ) 

identically in x , y . We may suppose that P contains no group of terms which 
vanishes identically modulo p 2 . Let mL" 2 Q^ 2 be any term of P; then pL" 1 - Q^ 1 
and mL" 2 Q 02 must be of the same total degree in x , y , and also of the same 
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degree in x alone. Therefore 

oi(p + l)+jSip(p--l) =o4(p + l)+&p(p--l), 

&lt;Xi p + & p (p - 1 ) = 2 P + ft P (P - 1 ) , 

whence ai = 2 , /3i = /3 2 Hence P consists of the single term pL? 1 Q ffl . 
Evidently pL" 1 Q 01 is not a product of fundamental invariants, hence the 
theorem is proved. 

11. If we consider the total group 

x s=ax + by, y = ex + dy ( mod p 2 ) , 

ad be ^p ( mod p ) , 

we find, exactly as in Theorem IV, that Q p is an absolute invariant, and that 
If , pL a QP are relative invariants of indices p , a , respectively. 

IOWA STATE COLLEGE, 
AMES, IOWA. 



THE GAUSSIAN LAW OF ERROR FOR ANY 
NUMBER OF VARIABLES* 

BY 

J. L. COOLIDGE 

The exponential law for the distribution of accidental errors of observation, 
discovered by Gauss, has been a mathematical classic for over a century. 
Many have been the attempts to prove it, all based, necessarily, on more or 
less arbitrary assumptions. Perhaps the most searching examination of it was 
given by Poincare in his Calcul des probabilities; his final opinion seems to be 
contained in the following phrase: f 

" J ai plaide de mon mieux jusqu ici en faveur de la loi de Gauss dont nous 
allons maintenant tirer les consequences. Peut-etre pourtant la cause n etait- 
elle pas parfaitement bonne. 

"Elle ne s obtient pas par des deductions rigoureuses, plus d une demon 
stration qu on a voulu en donner est grossiere, entre autres celle qui s appuie 
sur l affirmation que la probabilite des ecarts est proportionnelle aux ecarts. 
Tout le monde y croit cependant, me disait un jour M. Lippmann, car les 
experimenteurs s imaginent que c est un theoreme de mathematiques, et les 
mathematiciens, que c est un fait experimental." 

The law has been extended to include the distribution of errors depending 
upon two variables, and in this form it has a certain importance in the theory 
of ballistics, and in that of statistical correlation; even the case of three vari 
ables has been slightly treated. The general case of n variables has never 
been taken up except in two recent articles by von Mises.f The treatment 
here is based on a very general form of analysis showing how an arbitrary 
distribution function will lead asymptotically to an exponential form. The 
analysis is very careful, the point of view extremely abstract, with little 
relation to practical applications. Moreover, the author gives no indication 
how the constants should be calculated in any particular case. It is the 
object of the present paper to deduce the Gaussian law for n variables by a 
method based upon the classical one for a single variable, but with somewhat 
broader and more explicit assumptions. In the second part we shall make 

* Presented to the Society, December 27, 1922. 

t Poincare, Calcul des Probabilites, Paris, 1896, pp. 196 and 149. 

{ Fundamenlalsatze der Wahrscheinlichkeilsrechnung, Mathematische Zeit- 
schrift, vol. 4 (1919), and Grundlagen der Wahrscheinlichkeitsrechnung, ibid., vol. 5 
(1920). See also Dodd, Functions of measurements, Skandinavisk Aktuarie- 
tidskrift, 1922. 
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the additional assumptions necessary to determine the coefficients in any 
particular case, and show how these latter may then be calculated in a simple 
manner. 

1. THE DEDUCTION OF THE LAW 

Suppose that we are concerned with measuring groups of m quantities. 
We shall, for simplicity, assume that all groups are equally trustworthy, 
although the extension to the case of differently weighted groups is not difficult. 
We shall make certain assumptions about the distribution of errors, meaning, 
thereby, accidental errors, for we assume that constant errors have been 
removed. 

ASSUMPTION 1. The a priori probability that a group of quantifies to be 
measured should take values in the infinitesimal region 



where the points X , Y , Z , He in a certain continuous m-dimensional manifold 
S , will differ by an infinitesimal of higher order from the expression 

f(X, Y,Z, --^dXdYdZ--- , 

where the function f is continuous with continuous first derivatives in S . 

ASSUMPTION 2. The probability that a group of quantities whose true values 
are X , Y , Z , in S should be observed to have values, after the removal of 
constant errors, which lie in the infinitesimal region 

x \dx, y %dy , z\dz, 

where (x, y, z, ) is a point of S, will differ by an infinitesimal of higher 
order from 

$(X , Y, Z, ,x,y,z, )dxdydz 

where the function $ is continuous with continuous first and second partial 
derivatives, and has a value independent of the choice of origin. 

The last part of the assumption is plausible in practice, because if we are, 
for instance, measuring a length on a scale, the accidental errors will arise 
from various physical causes independent of the position of the . Moreover, 
it has a momentous consequence, for 

3&gt;(X, Y, Z, , x,y,z, ) 

= * 0,0,0, ,*- X,y - Y,z- Z, ...|. 
Waiting in the explicit values of the errors we have 



that is to say, the probability for a system of errors is a function of those 
errors, and not of the true values and observed values considered as independent 
variables, a point which has been a stumbling block to some writers. 
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ASSUMPTION 3. The mean value for the error on an individual variable is . 

This again is plausible, for a contrary assumption would show a tendency to 
favor positive or negative errors, and such a tendency we should naturally 
class with the constant errors, not with the accidental ones. As a further 
matter of notation let us write the averages 

_ _ Xi + x 2 + + .r n _ .vi + y-i + + y n 

x , y - &gt; 

n n 

(1) 

21 ~r 22 ~l i z n 

z = - , . 
n 

The actual errors are 



(2) & = Xi - X , 

The residual errors are 

5 t - = Xi x, a = Hi y 

Xi x = Xi - X - - ( x - X ) , 

(3) ._,._&lt;&. 



n 

The individual groups of observations are independent of one another; 
hence, by Assumption 3, 

Mean value , % } = , 

n 1 
Mean value 5? = - - Mean value ? , 

n 

? 

(4) Mean value ? = Mean value 

Mean value t - t]i = Mean value 



711 
y f 

=^- 

n 1 



ASSUMPTION 4. If the infinitesimal increments dx , dy , dz , be sufficiently 
small, the probability that the true values lie in the region 

x \dx , y %dy , z \dz , 

is greater than that they lie in any other region of like structure about any other 
point. 

We have now made a sufficient number of assumptions to enable us to 
deduce the analytic form for our functions. We do this, following the original 
method of Gauss, by calculating the probability that a given set of observa 
tions should have resulted from observing a group of quantities of assumed 
true value. The probability that the measurements x\, y\,z\, , Xi , y-&gt; , z 2 , 
, x n , y n ,z n , were made on quantities whose true values are X , Y , Z , 
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is by Bayes theorem 

f(X,Y,Z, ...)*!* 3&gt; n dXdYdZ 



* = *(;, fii, ft, ) 

The integration in the denominator is supposed to be extended throughout 
the whole region S . This expression will be a maximum with the logarithm 
of its numerator. Equating to the partial derivatives to X , Y , Z , , 
we get 

_ d log / d log $1 d log $ 2 , , d log $ _ 

dX ~db~ ~%~. *. 

_ 6 log/ ^ log $1 , 6 log $2 , . , d log $n = Q 

dY drji 3r?2 dr; n 

One set of solutions will arise in case all of the observations have been 
correct, i.e., 



Now, by definition, / is independent of n , hence 

&lt;*hzf-&lt;*^$l- -0 f- ist 

az SY 

Let us underline the fact that we are considering probabilities for observa 
tions which do not go outside of the region S . We could not have/ a constant 
throughout all space without a contradiction, and it is also evident that / 
must be rigorously throughout most of space. The partial differential 
equations now take the simpler form 

d log $1 , d log $ 2 , , d log 3&gt; n _ 

dfi a& a& 

d log $1 , d log $ 2 , . _ , d log $ = Q 



These equations hold whenever 

X = x, F = ^, Z = z, 

i + &+ +^ = 0, 

?1 + ?2 + + 7?n = 0, 
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By Assumption 2 we are free to treat our assumed groups as if they were 
absolutely independent quantities, provided we do not go outside of S . Let 
us, then, assume that the observed groups x\ , r 2 , , x n ; y\ , y z , , y n , 
Zi,zz, ,z n take such infinitesimal increments that the averages x , y , z , 
in (1) are not altered: 



log $1 1 s,- , 9 [ a log $ 2 1 a rajog$ n ~| 

.... Osi + -^r ... ?2 + + rr- n = 0, 

oi a$2 L "s2 &lt;?sn, L ^si 

a r a log $i "I a r a log $ 2 "I ^ . , a r d iog_4&gt; i =Q 



a r a 


log&lt;3 


.1 


i a r 


a log &lt;3 


h l^ 


in 1 


I 


a ra 


log ^1 


1 /5r 


. 


arji (_ 


&lt;5r?i 


0771 


drj 2 L 


d-m 


J 


TJ J 1 


1 


drinl 


^r?,, 


J 5 


fn - 



We have also 

Si + afc + +5| n = 0, 

5r?i + 5rj 2 + + 5rj n = 0, 

Each of the first set of equations in the variables di , d 2 , , rf$n must 
hold whenever the last equation in these variables holds, hence, integrating 
once, and dropping subscripts, 



Giving the successive values 1 , 2 , &gt;& an d doing the same for the other 
variables, and then summing, we have 

w = 0, 

a log $ 

= 



Here the expression \f? is a homogeneous quadratic form in the variables. 

Our assumptions are sufficient to enable us to make a very definite statement 
about the function \f/* , namely, that its discriminant is not zero. For if the 
discriminant were zero, the partial derivatives would be linearly dependent, 
and vanish for an infinite number of sets of values for the variables, and this 
is directly in conflict with our fourth assumption that the only maximum 
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arose from taking all of these variables equal to zero. Furthermore, since 
this is a maximum, we know that the form is definite, i.e., 

The homogeneous quadratic form \p is positive and definite, with a non-vanishing 
discriminant. 

2. DETERMINATION or THE CONSTANTS* 

It should be emphasized that everything which we have done so far is under 
the assumption that we are dealing with observations in the region S . We 
have found the probability that an observation in the region S should lie in 
a certain infinitesimal sub-region. In practice this is of no interest whatever 
until we have some idea of what the region S may be. It certainly could not 
be the whole of space, as the assumption that / is everywhere constant leads 
to a contradiction. On further consideration we notice two things. First of 
all, it seems quite plausible that / might be constant throughout a certain 
region, and equal to almost everywhere else. Second, the expression (6) 
is excessively small, except in a very strictly confined space, and this rapid 
diminution of (6) would produce a result close to that of the vanishing of/. 
In other words, the error in calculating the constants will be very small if we 
assume that the formula (6) is universally valid. On the strength of this 
we make 

ASSUMPTION 5. For the purpose of calculating constants, formula (6) may be 
assumed true throughout all space. 

We note, secondly, that the only method for calculating our constants is 
to assume that certain observed values may be identified with their mean 
values as calculated by formula. The right sides of the last two equations 
(4) are proportional to the mean values of the averages of certain observed 
quantities, and we know by Tchebycheff s theoremf that it is highly likely 
that the value of an average shall be close to its mean value. This leads to 

ASSUMPTION 6. When the number of groups is large, the mean values of 
2 ; V 2 &gt; &gt; V &gt; may be equated to the observed values 



n I n 1 n 1 
These quantities give, of course, the probable errors of individual measure- 



* The mathematical manipulation that follows depends on obvious applications of the 
theory of determinants. The methods and final formula are very close to Greiner, Z e i t - 
schriftfurMathematikundPhysik, vol. 57, pp. 226 ff ., and Pearson, Philo 
sophical Transactions of the Royal Society, vol. 187, pp. 299 ff. Pearson 
assigns the credit to Edgeworth, Philosophical Magazine, ser. 5, vol. 34, p. 201. 
I must confess to finding Edgeworth so obscure that I do not know whether his result is like 
mine or not. Moreover, none of these writers seem to me to set forth the underlying assump 
tions with desirable clearness. 

t Tchebycheff, Oeuvres, Petrograd, 1899, vol. 1, p. 687. 
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ments and their correlation coefficients two by two. It is interesting that 
these should be the only independent constants. 

In order to clarify the manipulation, we shall at this point take the perilous 
step of changing our notation. Logically, the notation we are now going to 
adopt might well have been used from the start, but the resulting summation 
formulas, by their very compactness, are rather obscure, and it is easier to 
see what is really going on, by using the more diffuse symbolism with many 
continuation signs which we have employed so far. For the actual errors 
committed, we shall write 

(7) = a-i , 7] = x 2 , r = .TS , , 
there being m in all. The n sets of in residuals shall be written 

Sll j 5j2 , , 5l m ; 21 , 22 , , Sim! j 8 n i , 5 n o i ) Onm 

Our fundamental formula (6) may now be written 

(8) 3&gt;=Re-- a w, d ij = a ii . 

The assumptions 5 and 6 may be expressed by the equation 



dx m . 



(9) Pi] - = ^* dki f kj = R C X ... r Xi Xj e-- 

n I J_ x J_ atl 

Since the discriminant of our quadratic form is not zero, we may find a 
linear transformation 

(10) ^ =- I&gt; fc .-r{, \ Cij *0&gt; 
such that 



i, i 



(11) ^CikCjidij = 0, k 4= I, 



/ i Cir Cjr dij O r . 
1,3 



Since the discriminant is an invariant of weight 2 , 

bib z b m = \Cij\ 2 - \dij\. 
The inverse of the substitution, contragredient to (10), is 



k = X) ik Cjk Wi Wj . 



, 

In the projective space of n -- 1 dimensions where a point has the homogeneous 
coordinates Xi , .r 2 , , x n the hyperquadric 
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has the tangential equation 



[September 



t, 3 



d | an I 
dan 



In terms of the new variables we have 







(12) 



/2 

- 

i &gt; 



i.) 



^~* Cir Cj 

b r 



We may express (9) in terms of the new variables. The only point to re 
member is that the jacobian of the transformation is c,-&gt; | ; 

Xoo /oo _26 y .z r 2 

I S c ffc cyz x x , e dx( dx z dx m . 
x J-X k , l 

This simplifies greatly because 

[*Cf] 

I 4 e-w"~ dx[. = . 

J 00 



Hence 



/00 / 

n | I I 

J03 / 



2&r^ 

c,/t CJ-A- .vf e f/.i i f/a-2 dx ,,, . 



03 / 00 



We have the well known integrals 

/x /*oo 

/ ft J """ / 2 / 2 J "^ 

e~ *** o.Xk = := ; f x k e~ *** a; c = ,^3^ 

J-oo Vfeft J-oo -i- 



Hence 



or, by (12), 



Pa = 






B|c&lt;,-| r 



m 2 



2 |o 



4 = r 



Furthermore, since the probability of some group of errors is 1 , 

/*oo /co 

1 = R I I e -*HW dan dx n , 

*J no *J x 



&.~V|e| 
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Dividing out R , we have 

YI . . 

t l} ~ o I I 
z I &ij I 

In these equations the quantities pij are known; we wish to find the quantities 
aij . We first introduce one more symbol: 

P,. = k 



Since the process of interchanging each element of a non-vanishing deter 
minant with its cofactor is an involutory one, except for multiplication by a 
power of the determinant, we must have 



We calculate M by a little jugglery: 



\rn-l 



M- 



M = 



Mol _ 

2m 1 f, m On 
1*13 ^ 



2 1 Pol 



Let us exhibit our results in the form of a table: 
Assumed errors : x\ , x 2 , , x n . 
Gaussian law of error for n variables : &lt; = Re~^ H XiX . 
Observed residuals: &lt;5n , 5 12 , , 8i m ; S 2 i , 5 22 , , 8 2m ; 



- J 5 ni , 



Pa = 



a-- 

I ~ 



- 
t n 



~"i 
- 1 



R = 
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CERTAIN THEOREMS RELATING TO PLANE CONNECTED 

POINT SETS* 

BY 

ANNA M. MULLIKIN 

I. INTRODUCTION 

A point set M is saidf to be connected if it cannot be expressed as the sum 
of two mutually exclusive point sets neither of which contains a limit point 
of the other. SierpinskiJ has shown that a closed, bounded, connected set 
of points in space of n dimensions cannot be separated into a countable infinity 
of closed point sets such that no two of them have a point in common. It 
will be shown in the present paper that for the case where n == 2 , this theorem 
does not remain true if the stipulation that M is closed be removed. It will 
however be shown that a plane point set, regardless of whether it be closed or 
bounded, which separates its plane cannot be expressed as the sum of a 
countable infinity of closed, mutually exclusive point sets, no one of which 
separates the plane. Of the other results established, the principal one is 
that if M i and M z are two closed, connected, bounded point sets, neither 
of which disconnects a plane S , a necessary and sufficient condition that their 
sum, M , shall disconnect S is that M , the set of points common to Ifi and 
M 2 , be not connected. 

I wish to thank Professor Robert L. Moore, who suggested the theorems 
of this paper. Without his help and encouragement it could not have been 
written. 

II 

The following is an example of a countable collection of mutually exclusive, 
closed, and bounded point sets with connected sum. Consider a countable 
infinity of arcs each of which is made up of four straight-line intervals (Fig. 1), 
the nth arc being drawn from the point ( m/2 n ~ l , ) to ( m/2 n ~ l , m/2 n ~ l ) , 
thence to ( - m/2"- 1 , m/2"- 1 ) , thence to ( - m/2 n ~ l , - m/2 n ~ l ) and thence 
to (m, -- TO/2"" 1 ) . Let n go from one to infinity, and let M be the point 

* Various parts of this paper were presented to the Society on October 25, 1919, December 
28, 1920, and February 26, 1921. 

t See N. J. Lennes, Curves in non-metrical analysis situs with an application in the calculi^ 
of variations, American Journal of Mathematics, vol. 33 (1911), and 
Bulletin of the American Mathematical Society, vol. 12 (1906). 

J W. Sierpinski, Un theoreme sur les continus, Tohoku Mathematical Journal, 
vol. 13. 
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set composed of the sum of all the arcs so obtained. It will be seen that each 
of these arcs contains a limit point of every subset of M which consists of an 
infinite number of the remaining arcs. Hence the set M is connected. It is 
obviously bounded. 



x 



Y&gt; 
FIG. !. 

THEOREM 1.* If, in a plane S , K and M are two closed, mutually exclusive 
point sets and H is a closed, bounded, connected point set having at least one 
point in common with each of the sets K and M , then there exists a point set H , 
a subset of H , such that H is connected and contains no point of either K or M , 
but such that K and M each contain a limit point of H . 

In our proof of Theorem 1, we shall make use of the following two well 
known theorems, A and B . 

THEOREM A.\ If K and M are two closed point sets having no point in 
common, and H is a continuous, bounded point set having at least one point in 

* Rosenthal gives a proof for the case in which each of the sets K and M reduces to a single 
point. See A. Rosenthal, Teilung der Ebene durch irreduzible Kontinua, Sitzungs- 
berichte der mathematisch-physikalischen Klasse der Bayer- 
ischen Akademie der Wissenschaften z u Munchen, 1919, p. 104. 

t Janiszewski gives a proof for the case in which each of the sets K and M reduces to a 
single point. His proof can readily be extended to the more general case. Cf. S. Janiszewski, 
Sur les continus irreductibles enlre deux points, Journal de L Ecole Polytech- 
nique (2), vol. 16 (1912), p. 109, Theorem 1. 
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common with each of the sets K and M , then H contains a subset which is irre 
ducibly continuous from K to M . 

THEOREM B* If HI, H 2 , H 3 , is a countable collection of connected 
point sets, and P is a point such that every circle containing P contains a point 
from all except a finite number of tliese sets, then the limit setf of the sequence 
HI , H&lt;i , HZ , is connected. 

Proof of Theorem 1. By Theorem A, H contains asubsetwhich is irreducibly 
continuous from K to M . Call this set H KM . Let K and M denote the points 
of H KM belonging to A" and M respectively. Let // denote H KltI K -- J/. 
We can show that H is the required H . Evidently it only remains to be proved 
that H is connected. Consider any point P of H . We can show that the 
largest connected subset of H in which P lies has a limit point in either A* 
or M . For suppose it has not. It will then be closed and may be enclosed 
in a finite number of circles no one of which contains or encloses a point of 
either K or M . The interiors of these circles form a domain D P . Now select 
some point AI of A . Since P and K\ lie together in the connected set H KM 
they can be joined by a broken line composed of a finite number of intervals 
of length less than half an inch, such that the vertices of this broken line 
belong to H KM . Let LI be that vertex on this broken line which immediately 
precedes the first vertex on it, in the order from P to AI , that lies without D P . 
Then join P and AI by a broken line of intervals of length less than a quarter 
of an inch such that the vertices belong to H KAt . Let L- 2 be the point on this 
line corresponding to LI . Continue this process indefinitely. By Theorem B 
the limit set will be connected. It will contain P and a point on the boundary 
of D P , I namely the limit point of LI, Z- 2 , L 3 , . It contains only points of 
HKM_&gt; but_since it lies wholly within D P plus its boundary, it contains no point 
of A or J/ and hence is a subset of H . This is contrary to the hypothesis 
that the largest connected subset of H in which P lies is within the domain D P . 

Denote by H K the set of those points lying in a connected subset of H of 
which A contains a limit point, by H M the set of those which lie in a connected 
subset of H of which M contains a limit point. Let H K + A be denoted by 
SK &gt; HM + M by S M . Since S K + S M = H KM , and since H KM is connected, 
S K and S M must have a point in common or else one of these sets must contain 
a limit point of the other. Suppose first that they have a point in common. 
This point must belong to H , and it is evident that since H KM is irreducibly 

* See S. Janiszewski, loc. cit., p. 98, Theorem 1. 

| By the limit set of a sequence of sets Hi, H 2 , H 3 , we mean the set of all points 
[P] such that P is a limit point of a set of points X lt Z 2 , X 3 , such that for every k, X k 
belongs to Ht. 

t Janiszewski gives a parallel argument to prove that if the continuous set C contains a 
point A which is an interior point of the closed set K , then there exists a continuous set con 
taining the point A and contained in K and C. See S. Janiszewski, loc. cit., p. 100, Theorem IV. 
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continuous, H must in this case be connected. Suppose secondly that one 
of the sets contains a limit point of the other, for instance that S K contains a 
limit point, P K , of S M . And suppose that H is not connected. Since S K 
contains a limit point of S M it is evident that H M must exist. Then if H M = H , 
H M cannot be connected. But suppose that the set H K actually exists; we 
can show that in this case, too, H M is not connected. For suppose it were. 
P K + H M together with the largest connected subset of H in which P K lies 
and all points in H K which are limit points of H M would then be a connected 
subset of H and consequently a proper subset of H . This together with its 
limit points in K and M would be both a continuous set between K and M 
and a proper subset of H K3t . This is contrary to the hypothesis that H K3I is 
irreducibly continuous between K and M . We have therefore shown that if 
H is not connected, H M is not connected. 

Suppose this to be the case, and let H M == H Ml + H Mi where H Ml and # 1/2 
are two mutually exclusive sets neither of which contains a limit point of the 
other. Suppose P K is a limit point of H Ml . Then enclose every point of H 3I , 
in a circle which encloses no point of the set H Ml + P K + M . The interiors 
of these circles form a domain D . Now since P K is a limit point of H Ml and 
every point of H m is connected with some point of M in a subset of H Ml + M , 
P K can be joined by an infinite number of broken lines, as before, to points of 
M such that the vertices of these broken lines belong to H^ + M , and there 
fore lie without D or on its boundary. The limit set will then be connected 
and will contain no point of D . This together with the largest connected 
subset of H in which P K lies and the limit points of this set in K will be a 
continuous set from K to M , a proper subset of H KM since it contains no point 
of H Al2 . This is contrary to the hypothesis that H KM is irreducibly continuous 
from K to M . We have therefore proved that H is connected and is the 
required H . 

THEOREM 2. If, in a plane S , H is a closed, bounded point set containing 
two mutually exclusive, closed point sets K and M , but containing no closed, 
connected subset containing a point of K and a point of M , then it is tJie sum of 
two mutually exclusive, closed sets, of which one contains K and the other 
contains M . 

Proof. There exists a positive number e such that no point of K can be 
joined to a point of M by a broken line made up of intervals of length less than 
e such that the end points of these intervals are points of H . For otherwise 
there would be a closed, connected " limit set" as in Theorem 1. This limit 
set would belong to H , since H is closed, and it would contain a point of K 
and a point of M , since K and M are both closed. This is contrary to the 
hypothesis. 

Now let HI denote the point set composed of K together with the set of all 
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points [ P ] of H such that P can be connected with some point of A by a 
broken line of intervals of length less than e such that the end points of these 
intervals belong to H . Let H 2 denote the point set composed of all other 
points of H . H 2 will contain M and it can easily be seen that neither HI 
nor H 2 contains a limit point of the other, since every point of H 2 is at a 
distance greater than or equal to 6 from every point of HI . 

LEMMA. // M is a dosed set not disconnecting* a plane S then any two 
points of S M can be joined by a simple continuous arc lying in S -- M . 

Proof. Let P denote any point of S -- M . Let Si denote the point set 
composed of P together with all points that can be joined to P by a simple 
continuous arc lying in S M . Let Sz denote the set S -- M --Si, and 
suppose that S 2 contains at least one point. Now since M does not dis 
connect S , Si contains a limit point of S 2 , or S 2 of Si . Suppose that Si 
contains a limit point PI of Sz Then enclose PI within a square A* which 
neither contains not encloses a point of M . This square will enclose a point 
P 2 of S 2 . Then PI and P 2 can be joined by a straight line interval lying 
within K and therefore containing no point of M . Since PI can be joined to 
P by a simple continuous arc lying in S M it is obvious that P 2 can also. 
The argument would be similar in the case where S 2 contains a limit point 
of Si . Since either leads to a contradiction we have proved that S 2 does not 
contain even one point. 

THEOREM 3. // M is the sum of a countable number of closed, mutually 
exclusive point sets MI , M 2 , M s , , no one of which disconnects a plane S , 
then M does not disconnect S . 

Proof. Suppose on the contrary that S -- J/ == Si + S z , where Si and 
Sz are mutually exclusive and neither contains a limit point of the other. 
Let MI denote a point_set composed of those points of M that are limit points 
of Si but not of S 2 , Mz the point set composed of those that are limit points 
of S 2 , but not of Si , and let M denote the point set composed of those points 
of M that are limit points of neither Si nor S 2 . There exists in S a countable 
collection K Pl of squares AI , A 2 , K 3 , obtained jn the following manner. 
Take any point PI of Si as the center of a square AI of side 2 inches. Let 
AI, A 2 , K 3 , A 4 be the four squares ^f side one inch each contained in A^ and 
taken in any order. In general let K n be a square of side 2" inches which has 
PI for its center and has its sides parallel to those of AI , and let it be divided 
into 2 4ra ~ 2 squares, each of side l/2 n ~ 1 inches, and let these 2 4 "~ 2 squares follow 
each other in any order, and let them be the squares A 2in _ 2+n ,- -, A 2 ^ +2 _ 4 
in the set K Pl . 1S 15 

Now consider the first square K( of A Pl which contains PI and satisfies 



* M is said to disconnect S if S M is the sum of two mutually exclusive point sets neither 
of which contains a limit point of the other. 
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condition (1) that it contain and enclose only points of Si + MI + M . 
There evidently exists one such square, since PI is not a limit point of 82 
Add to A ( the first square K 2 of A Pl that encloses no point of the interior of K( 
and that satisfies condition (1) and also condition (2) that it shall contain 
or enclose at least one point of Si, and condition (3) that it shall have an 
interval in common with K( . It is evident that there exists a simple closed 
curve C 2 which is a subset of K[ + K 2 and such that the interiors of K( and K 2 
are subsets of the interior of C 2 . In general obtain C 3 , C 4 , C 5 , in the 
following manner: C r shall be obtained by adding to (7 r _i the first square K r 
of K Pl which encloses no point of the interior of C r -i and which satisfies 
conditions (1) and (2) and contains an interval in common with C r -\ . Then 
C T is a simple closed curve which is a subset of CV-i + A and whose interior 
contains the interiors of C r -\ and K r . It can easily be shown that provided 
K r exists and C r -i can be obtained in this manner, then C r can also. For let 
Ai A 2 be an interval common to C r -\ and A . Let AI BA 2 (Fig. 2) be an arc 




FIG. 2. 

lying except for its end points within A and let AI CA 2 be an arc lying except 
for its end points within C r -\ . Then in Theorem 41 of The foundations of 
plane analysis situs* let J\ be C r -i A\ A 2 + AI BA 2 and let J 2 be 
K r Ai AI + Ai CA 2 . Then J will be the required C r . Denote by C the 
set of curves Ci ( K( ) C 2 , C 3 , 

The sequence C will evidently be infinite unless some Ci contains only 
points of M . Suppose this to be the case, and suppose first that there is 
some point P E of Si or $2 without Ci . The curve Ci divides S into two parts, 
its interior I , and its exterior E , such that neither of these parts contains a 
limit point of the other. Now all of Ci must belong to the same set in M , 

* See R. L. Moore, On the foundations of plane analysis situs, these Transactions i 
vol. 17 (1916), p. 155. 
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say MJ, since there does not exist a countable collection of closed, mutually 
exclusive point sets, consisting of more than one set, whose sum is closed, 
bounded, and connected. f Then MJ disconnects S; one part of S - MJ is 
composed of those points of I which do not belong to MJ ( PI is one such 
point) ; the other part is composed of those points of E which do not belong 
to MJ (by hypothesis there is at least one such point). But this is contrary 
to the hypothesis of the theorem. Suppose, secondly, that all points of d 
and its exterior belong to MJ . Start from PI again to add squares of K Pl , 
this time adding the extra condition that no square added shall contain or 
enclose a point of C or its exterior or of If,-; this is possible since MJ and the 
set composed of d together with its exterior are closed sets. Suppose again 
that, some C k of this new set of curves is a subset of M , and therefore of, say, 
MI . If there is some point of Si or S 2 without Ck , then, as above, MI dis 
connects S , which is contrary to hypothesis. But there must be a point of Si 
or S 2 in the closed, connected, bounded point set made up of d + C k and all 
points between them, as otherwise this set of points would be the sum of a 
countable collection of closed, mutually exclusive point sets, subsets of M , 
and would contain subsets of at least two of the sets M\ , M&lt;&gt; , M 3 , , namely 
of MJ and of MI. Thus in any case we obtain an infinite sequence C , every 
curve of which contains at least one point of Si and contains only points of 
Si + MI + M . Let DI denote the sum of the interiors of these curves. 

Now suppose there is no point of 82 without DI . Then take a point P 2 of 
Sz that is within some curve C Pt of C and add squares to it in the manner in 
which we obtained K{ , K 2 , K 3 , , except that these squares with their 
interiors are subsets of S 2 + 1/2 + M and each one contains or encloses at 
least one point of S 2 , and add the extra condition that no one shall contain a 
point of C P2 or of its exterior. The sum of the interiors of the curves so 
obtained will be a domain D 2 , a subset of the interior of C Pt and therefore not 
containing all points of Si . Since D 2 would serve as well for the argument as 
DI we shall suppose that not every point of S 2 is within DI . 

Then since DI does not contain all of S it must have some boundary points; 
let B denote the boundary of DI. Suppose B contains a point PJ of Si. 
There is a square R of the sequence K Pl which encloses or contains P{ but 
no point or limit point of S 2 and which encloses a point of a curve of the se 
quence C . Consider the first curve of C which was obtained by adding a 
square having an interval in common with R or its interior. Let R* be the 
square so added. Consider two cases. 

Case I. It is given that R* has an interval in common with R. If the 
interiors of these squares are mutually exclusive, R possesses all of the proper 
ties necessary for it to be added in obtaining some curve of C , and it will 

j Cf. M. Sierpinski, loc. cit. 
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subsequently be added or enclosed by the addition of some other square of K Pl . 
If the interiors of R* and R are not mutually exclusive, evidently the interior 
of R must include that of R* . But R precedes R* in the sequence K Pl and 
would have been used instead of R* , since R contains an interval of R* that 
R* has in common with that curve of C to which we supposed it added. 

Case II. It is given that R* has an interval in common with the interior 
of R . In this case the interior of R* lies wholly within R but obviously they 
must also have an interval in common, since R* was the square added to 
obtain the first curve of C having an interval in common with R or its interior. 
The argument is the same then as in Case I. We have therefore proved 
that B contains no point of Si . 

Furthermore B contains no point of S% . For suppose P 2 is a point of 82 
belonging to B. It is not on any curve of C; therefore it must be a limit 
point of an infinite number of curves of C . Since P 2 is not a limit point of Si 
there exists a square Kh with P 2 as center, which neither contains nor encloses 
a point of Si . Suppose a side of Kh is t\ inches long. There exists only a 
finite number of squares of K Pl of side equal to or greater than fi/8 that have 
points in common with Kh or its interior. Let Q denote this set of squares. 
If any square of Q was used in the sequence K( , K 2 , K 3 , , let K n denote 
the last square in this sequence that belongs to Q . Then C n will be the last 
curve of C formed by adding a square of Q . If no square of Q was so used, 
let C n denote any curve of C . Then Kh encloses a point PI which lies on no 
curve of the set Ci , C 2 , , C n , but which does lie on a curve of C following 
C n and such that the distance from P 2 to PI is less than ei/4 . Then PI must 
be a point on a square of the sequence K{ , K 2 , K z , of side less than ei/8 
and therefore is at a distance of less than ei/4 from a point of Si , since every 
square of the sequence K( , K 2 , K 3 , contains or encloses a point of Si . 
This point of Si would then lie within K h , which leads to a contradiction. 
We have therefore proved that B is a subset of M . 

We can now prove that two closed, mutually exclusive point sets neither 
of which disconnects &lt;S cannot together disconnect S.f For suppose 
M = MI + 1/2 . We have shown above that not every point of S 2 is in D\ 
and that no point of S% is on B . Let P denote a point of S* without Di . 
There is a simple continuous arc from P to Pi . Let BI denote the first point 
of B on this arc in the order P PI . Suppose that MI is that one of the sets 
MI and MZ to which BI belongs. Since MI is a closed set not disconnecting 
S , PO can, by the lemma, be joined to PI by an arc not containing any point 
of MI . Let B 2 be the first point of B on this arc J in the order P Pi . Then 

t Hausdorff gives a proof for the case when one of the sets is bounded. Cf . F. Hausdorff , 
Grundzuge der Mengenkhre, Leipzig, Veit, 1914, p. 342. 

J Hereafter in this paper, "arc" and "simple continuous arc" will be considered synonymous 
terms. 
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B 2 belongs to M 2 . The set P #1 + Po B 2 contains as a subset an arc BI B z . 
Let #1 be a simple closed curve enclosing #1 but neither containing nor 
enclosing any point of M 2 (Fig. 3) and containing only one point LI of BI B 2 , 




and let H 2 be a simple closed curve neither containing nor enclosing any point 
of HI or MI , but enclosing B 2 and containing only one point LI of BI B 2 . 
Let a point of D\ within H\ be joined to a similar point within H% by an arc 
lying within D\ . There will be a subset of this arc, an arc RI R 2 , such that 
RI lies on Hi , R 2 on H 2 and all other points of RI R 2 lie without both HI and 
Hz . Then there is a simple closed curve J\ composed of LI L 2 + RI RZ 
together with either arc LI RI on H\ and either arc L 2 ^2 on H 2 . It is 
evident that the points of B on or within Ji that belong to M i can be enclosed 
in a finite number of circles no one of which contains or encloses a point of 
LI L 2 + L 2 R 2 + RI #2 + M 2 . And similarly those points of B on or within 
Ji that belong to M 2 can be enclosed in a finite number of circles no one of 
which contains or encloses a point of LI L 2 + L\ R\ + RI R 2 + MI or a point 
on or within a circle of the first set . Then, clearly, a point on LI L* , and 
therefore without D\ , could be joined to a point on RI R 2 , and therefore with 
in DI , by an arc lying within J\ and without both these sets of circles, and 
therefore not containing a point of B . Since this leads to a contradiction we 
have shown that if M consists of only two sets it cannot disconnect S . This 
result can evidently be extended to the case where M consists of any finite 
number of sets. 

Consider the arcs Po #1 and P 5 2 above. Let A 2 denote the first point 
of B 2 Po on PO BI . If A 2 is different from P it is evident that a point on 
B 2 A 2 very near A 2 can be joined to P by an arc lying without DI and con- 
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taining only P in common with P BI . From this together with P 2 A 2 we 
obtain an arc P B 2 which lies, except for P. 2 , without DI and has only P 
in common with P BI . Let M Bl denote that set of M to which BI belongs, 
M Bl the set to which P. 2 belongs. Since M Bl + M B2 does not disconnect S , 
P can be joined to PI by an arc not containing any point of M Bl or M Bt . 
Let P&gt; 3 denote the first point of B on this arc in the order P PI , and let M m 
denote that set of the collection M to which P 3 belongs. As above, let A 3 
denote the first point P&gt; 3 P has on the sum of the arcs PO PH and P BI . By 
taking a point on B 3 A 3 very near A 3 , and drawing a suitable arc to P , 
we obtain an arc P .63 which has only PO in common with P P&gt;i or P P2 , 
and which lies, except for P&gt; 3 , without DI . Similarly obtain the arc PO BI 
where B 4 belongs to B and to M B4 , such that P PM has only P in common 
with P BI , P B 2 , or P B 3 , and lies, except for P&gt; 4 , without DI . Now it is 
evident that the sum of two of these arcs is an arc crossing the sum of the 
other two. Suppose for instance that the arc P BI + P B 3 crosses the arc 
P P2 + PO B 4 . Let HZ be a simple closed curve (Fig. 4) enclosing BI but 




FIG. 4. 

neither containing nor enclosing any point of PoB 2 , PoB 3 , P P&gt;4, M B2 , 
M B3 , or M Bt and such that it contains only one point L 3 of the arc P Pn . 
Let Hi be a simple closed curve enclosing B 3 , containing only one point L 4 
of the arc P P 3 , and neither containing nor enclosing any point of H 3 , PO BI, 
P P2 , PO P4 , M Bl , M Bl , or M Bt . Now consider the first curve of C, C A , 
which has points within both H 3 and H . There is an arc R 3 B 4 , a subset of 
C A , such that R 3 is on H 3 , PM is on H and all other points of R s PM lie with 
out both H s and H . Select one of the arcs L 3 R 3 on H 3 and one of the arcs 
Li Pi 4 on Hi and let those selected be denoted by L 3 R 3 and Z 4 P 4 throughout 
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the discussion. Then L 3 L t + L 4 Pi 4 + Pi 3 # 4 + L 3 R 3 is a simple closed curve 
J 2 Now let R 3 R 4 be an arc on C^ lying on J 2 or within J 2 except for its end 
points, and having R 3 on H 3 and R t on # 4 and all other points_ without _both 
H^ and H , such that if we consider the simple closed curve J 2 which R 3 R* 
forms with that arc of J 2 (from #3 to # 4 ) that contains L 3 L t , there is no arc 
of C A (except R 3 R t ) lying on J 2 or within J 2 except for its end points, and having 
one point on H 3 and one on HI . It is possible that the arc Rs RI will_ be the arc 
# 3 Pt 4 Now it is evident that either P -B 2 * or P #4 will lie within J 2 . Suppose 

P B 2 does. Then B z will lie on or within J 2 . The point B z is a limit point 

of an infinite sequence of curves of C following C A . Let PI , P 2 , Pa , 
denote a sequence of points on successive curves of this sequence such that 
B z is a sequential limit point of the set PI , P 2 , P 3 , . If #2 is on ^2 these 
points may all coincide with 5 2 ; if not, let them be chosen so that they lie 
within Jz 

Denote by Ci that curve of C on which P { lies, and let Ni be the last point 
of d starting from P t - in either order around C such that P, N lies onjor 
within J 2 . It will be seen that every Ni will have to be on L 3 Rs or L.\ Pi 4 , 
for if any Nf is on R 3 RI there must be a point F of d very near Ni , without 

J 2 and therefore on the opposite side of C A from P B z , for if B 2 is not on 

C A it can be joined, because of the condition put upon R 3 R^ , to any point P 
on #3 RI by an arc having only the point P on C A . Now since PO B 2 is 

without C A , F must be within C A . This is impossible, since P is on a curve 
following C A in C, and is therefore on or without C A . Now the set of arcs 
Pi Ni (where i = 1,2,3, ) determines a limit set Y such that Y is a closed, 
connected set, every point of which is on or within J 2 . Since P. 2 belongs to 
Y , Y must be a subset of M Bl . But there will be a point of Y on either 
L z R 3 or L 4 R* , namely a limit point of the set of points NI, N 2 , N 3 , 
This leads to a contradiction, for neither H 3 nor H^ contains a point of J/ B! . 
The supposition that M disconnects S is therefore proved false. 

THEOREM 4. If MI and J/ 2 are two closed, connected, bounded point sets, 
neither of which disconnects a plane S , unnecessary and sufficient condition that 
their sum, M , shall disconnect S is that M , the set of points common to M\. and 
Jtf 2 , be not connected. 

Proof. The condition is necessary. For suppose that M is connected; 
we can prove that M does not disconnect S . For suppose S J/ = Si + &lt;S 2 
where &lt;Si contains no limit point of *S 2 nor S 2 of Si. Let S be a square 
enclosing M. Then it is evident that either Si or S 2 must lie within S. 
Suppose Si does. Let Pi be a point of Si, and consider all points which lie 

* If Po 5 2 is an arc, Pt&gt; 5 2 denotes the point set P B 2 - P - 2 . 
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with PI in a connected subset of Si . It will be seen that since M is closed, 
these points form a domain D s , a subset of Si , and of the interior of S , such 
that the_ boundary of D s is a subset of M_ . Let M denote the point set 
M! -- 1/2 and M" the point set J/ 2 - M . Let P 2 denote a point of S 2 
that is without S; by the lemma, P 2 can be joined to PI by an arc not con 
taining any point of M 2 . Let P denote the first point the arc P 2 PI has on 




the boundary of D s (Fig. 5), let P 2 be joined to PI by an arc not containing 
any point of MI , and let P" denote the first point of this arc P 2 PI on the 
boundary of D, . Then P belongs to M and P" to M" . Then there exists 
an arc P P 3 P" , a subset of P 2 P + P 2 P" , which lies, except for its end 
points, without D s . As in Theorem 3, there exists a simple closed curve H i 
enclosing P , containing only one point LI of the arc P P 3 P" , and neither 
containing nor enclosing a point of J/ 2 , and there exists a corresponding 
curve HI enclosing P" , containing only one point L 2 of the arc P P 3 P" and 
neither containing nor enclosing a point of MI or HI , and there exists an arc C\ R\ 
in D s having only C\ on HI and RI on H z . Then there is a simple closed curve 
/i , composed of LI P 3 L 2 +_ Ci RI together with the arcs LI C\ on HI and 
L 2 RI on H 2 so chosen that Ji encloses P and P" . 

We can show as in Theorem 3 that if those points of the boundary of D s 
that this curve contains or encloses belong to M + M" then two points, 
one without D s and the other within D s , can be joined by an arc lying within 
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Ji and containing no point of the boundary of Ds . Since this is impossible J\ 
must enclose some point of M , and since M is connected and since no point 
of M is on Ji , M must lie wholly within Jj . 

A ray of a straight line can be drawn from P 2 , lying wholly without S , 
and there exists a ray r\ , a subset of the first ray together with one of the arcs 
PI PI , such that ri lies without J\ except for its end point P a on P P 3 P" . 
Let S denote a square enclosing S and both arcs PI PI and containing only 
one point P r of r\ . If L is a point on HI and J\ sufficiently near LI there 
exists an arc P c L such that P c is a point on S and such that P c L lies without 
D s , P c L - L lies without Ji and P c L - P c lies within S . Similarly a 
point L" of H 2 and Ji can be joined by an arc to the point P D of S , and in 
such a way that the arc P D L" contains no point of the arc P c L . Let L L" 
denote the arc composed of P c L + P D L" together with that arc of S , 
from P c to P D , which does not contain P T The arc L L" together with 
the arc L C\ R\ L" of J\ makes a simple closed curve J 2 whose interior has no 
point in common with that of J\ . Since J 2 encloses points of D s we can show 
as above that it must^contain all of M . Since this is obviously impossible, 
the supposition that M was connected is proved to be false. 

The condition is sufficient. For suppsoe M is not connected. Then it is 
the sum of two closed, mutually exclusive point sets, MI and If 2 . We can 
show that in this case S M = Si + S 2 where Si and S 2 are mutually exclu 
sive and neither contains a limit point of the other. 

The point set MI contains some point PI which is a limit point of M" since 
MZ is connected and MI and M 2 are closed point sets. Let J Pl denote a 
circle enclosing PI but neither containing_nor enclosing any point of M 2 . 
This circle contains a point Pf of M 2 MI M 2 , and there is a ray of an 
open curve from Pf not containing any point of MI , since M i is closed and 
bounded and does not disconnect S . And a subset of this ray is a ray r 2 , 
having an end point on J Pl and lying, except for this end point, without J Pl . 
Now, by the Heine-Borel theorem, M 2 can be enclosed in a finite set of circles 
no one of which contains or encloses any point of MI or J Pl or r 2 . Consider 
any one of these circles together with all of the set that are connected with it. 
By Theorem 42 of The foundations] there exists a simple closed curve J C2 , a 
subset of these circles, wjipse interior contains all of their interiors. It will 
not contain any point of M , but will enclose some points of M; let M* denote 
the set of these points. The curve J Pl with r 2 will be without J Cj , . Now, as 
above, there exists a ray r s which has its end point on J Cj , lies, except for this 
point, without J C2 and contains no point of M x . Then all points of If that 
are without J C2 can be enclosed in a finite set of circles no one of which contains 
or encloses any point of J Ct or r 3 . If these circles do not form a connected 

t Cf. R. L. Moore, loc. cit, p. 156. 
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point set they can be joined by a finite number of arcs not containing any 
point of J Cj + TZ . For J C2 with its interior does not disconnect S , and r s 
does not disconnect S and they have only one point in common; therefore, 
by the first part of this theorem, J Ct with its interior and r 3 does not disconnect 
S . These arcs can be covered by a finite set of circles not containing any 
point of J f2 or r 3 , and hence from all these circles a simple closed curve J Cl 
can be obtained which has the following properties: it is wholly without J C2 
and does not enclose it; it encloses all points Iff, of M, that J C2 does not 
enclose ; therefore it contains no point of M . 

The points of 3/ 2 that are on J Cl or J&lt;- 2 or without both J Cl and J C2 form a 
closed set, a -subset of M" . This set can be covered by a finite set of circles, 
T , such that no circle of T contains or encloses any point of 3/i . Xow there 
is a ray of an open curve, not containing any point of 3/i , from some point 
within each of these circles. Let [ r t ] denote the sum of such rays. And 
let MI be covered by a finite set of circles no one of which contains or encloses 
any point of the circles T , or of the point set [r t ] + r-i + r 3 . Since 3/i is 
connected this set of circles will be connected, and, as above, there exists a 
simple closed curve J\ which is a subset of them and encloses all of their in 
teriors and therefore all of MI . The curve Ji cannot wholly enclose either 
J Cl or J C2 since it contains no point of r 2 or r 3 . 

Let S* be a square enclosing -7 Cl , J C2 , Ji and 3/ 2 . It is evident that there 
is an arc P 3 P 4 composed of a finite number of straight line intervals lying 
within S* , except for the points P 3 and P 4 which are on S* , and which 
separates the interior of S* into two parts such that J Cl lies wholly within one 
part and J C2 within the other. Let A be a point of M within J Cl and B a point 
of M within J C2 . By Theorem 1 of this paper there is a connected subset of 
3/2 lying within J\ such that A and some point A of Ji are limit points of it. 
Moreover this set will lie within J Cl since it is connected, A lies within J Cl 
and Jj encloses no point of M 2 that is on J Cl . Similarly there exists a con 
nected subset of 3/2 lying within J\ and J C2 such that B and some point B 
on J\ are limit points of it. 

Now there is a subset, CD , of P 3 P 4 which satisfies the following conditions: 
it is an arc lying except for C and D within J\; C and D are on J\ and separate 
A and B on J\ . For consider all arcs of P 3 P 4 which lie except for their 
end points within /i and have their end points on J\ . There are a finite 
number of these, since J\ is composed of a finite number of arcs of circles 
and P 3 P 4 of a finite number of straight line intervals. Let LI , L 2 , L 3 , , 
denote the set of all such arcs and let L denote the point set LI + L* + // 3 . 
Suppose that no L } separates A and B on J\ . Denote by a and a," the two 
arcs A B on J\. If any arc L,- of L has an end point on a for instance let 
Xi denote the first such end point on a in the order A B and X 2 the other 



158 



ANNA M. MULLIKIX 



^September 



end point of this arc. Then consider the simple closed curve A"i A + a" 
+ B X-z + X-z Xi . It will be seen that A and B can be joined by an arc 
lying on or within this curve which contains no point of Li. By a finite 
number of repetitions of this process it is evident that we can obtain a simple 
closed curve C x , a subset of Jj plus its interior, containing A and B but 
enclosing no point of L . A and B could then be joined by an arc lying on 
or within C x and containing no point of L and hence no point of P 3 PI . Since 
this is impossible there must exist an arc CD , having the properties stated 
above. 

Since CD , being a subset of P 3 P 4 , lies without both ,/ Cl and J c , and since 

it lies within J\ , it contains no point of MI . Therefore of the two regions into 
which it divides the interior of Ji,* one must contain A, the other B . Let 
R denote the interior of Ji . Let F\ F 2 F s be an arc as indicated in Fig. 6 




with its end points on J t and lying, except for its end points, without Ji and 
wholly without J Cl and J C2 and such that the simple closed curve FI CF 3 F* F! 
neither contains nor encloses a point of M ! or J/ 2 . Let Ci be a point within 
this curve. Let F( F z F 3 be a similar arc forming part of a simple closed 
curve F i DF 3 F 2 F{ and let DI be a point within it. Now if we suppose that 
M does not disconnect S , let NI be the first point of the arc CD on MI . Then 
by the first part of this theorem, CWi + M does not disconnect S . Let A T 2 
be the first point the arc DC has on M l ( NI and N 2 must exist since J/i is con- 
* Cf. R. L. Moore, loc. cit., p. 141. 
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nected and CD divides R into two parts, of which one contains A , and the 
other contains B ) . By several applications of the first part of this theorem, 
we can show that M + CN^. + DN* + t\ CF 3 +_F{ DF Z does not disconnect 
S . Let H denote this point set. Then C\ and Z&gt;i can be joined by a broken 
line containing no point of H . By extending this broken line we get a broken 
line CED which has only C and D on H . By Theorem 43 of The foundations^ 
there is a simple closed curve C , a subset of Ji together with the broken line 
CED whose interior is a subset of R , which encloses NI but no point of the arc 
CED. Then C will enclose all of J/j + (CN^- C) + (Z)iV 2 _- Z&gt;), since 
this is a connected point set and has no point on C . The curve C will contain 
FI CF 3 , F{ DF 3 , A and B since it encloses A and B and contains no point 
of the previously described connected sets of M 2 between A and A , and B 
and B . Then it is evident that there is a curve C , which has all the above 
mentioned properties of C except that it contains only the points C and D 
of the broken line CED and which is such that (1) it is a subset of C plus its 
interior, (2) those points of C that are not on C do not belong to M . 

It is possible that the broken line segment CD C - D (a subset of 
Pz P] will not lie within C . Consider the arcs of the broken line CD which 
lie except for their end points without C and have their end points on 6". 
Let C D denote such an arc. C and D are points of R. Let E denote a 
point of C D without C . The points C and D can be joined by an arc 
C F D such that C F D lies within I" and therefore within J x . Then the 
simple closed curve C E D F C is a subset of R , and therefore the arc C GD 
of C that it encloses must be a subset of R, and therefore it must belong to 
that part of C which does not belong to C and which therefore contains no 
points of M . Now there are only a finite number of arcs such as C E D , that 
are parts of the broken line CD , which lie except for their end points without 
C and have their end points on C , and we see that each one can be replaced 
by an arc lying on C and containing no point of M z This gives us a con 
tinuous curve from C to D lying on or within C and containing no point of 
M-2 , and there is an arc, a subset of this curve, having the same properties. 
It is evident that this arc can be replaced by an arc CE* D lying within C 
except for C and D and containing no point of il/ 2 . Now C and D separate 
A and B on C . For if they did not, A and B could be joined by an arc 
ai lying, except for A and B , within C , and C and D could be joined by a 
similar arc a 2 such that a 2 had no point in common with a\ . But both these 
arcs would lie except for their end points within C and J\ and have their end 
points on C and J\ . This leads to a contradiction, since C and D separate 
A and B on J\ . 

Now the arc CED , together with the arc CE* D , forms a simple closed 

t See R. L. Moore, loc. cit., p. 157. 
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curve Js containing no point of M , but such that one of the points A and B 
is within it, and the other without it. This is impossible since If 2 is a con 
nected point set. Therefore the supposition that Ci and DI can be joined 
by an arc containing no point of H is false and since H disconnects S , M 
must disconnect S . Evidently one of the two sets into which M separates S 
is the point set, Si , composed of C\ together with all points that can be joined 
to it by arcs containing no point of If . The set S 2 will then be the point set 
S - M - St . 

THEOREM 5.* If MI and If 2 are tivo dosed, bounded, connected point sets in 
a plane S , such that neither MI nor MZ disconnects S and such that M\ and MZ 
have in common only KI and KZ , where KI and KZ are mutually exclusive con 
nected sets, then S MI -- MZ is the sum of just two unit null i/ exclusive, connected 
domains. 

Proof. We have shown above that under the conditions of this theorem 
S MI MZ is not connected. Suppose then that it is the sum of more 
than two mutually exclusive connected domains. There will exist three points, 
PI, P 2 and P 3 , no two of which can be joined by an arc containing no point 
of M i + MZ . It is evident that in the preceding theorem the curve J : 
could have been constructed in such a way that PI , P 2 and P 3 were without it; 
for there exist three open curves, containing PI , P 2 and P 3 but containing no 
point of ML , and J\ could have been drawn so as not to contain any point of 
these open curves. We shall suppose that J\ has been so drawn. We can 
furthermore suppose that Ji is replaced by a polygon IF, satisfying the condi 
tions which J\ satisfies. 

Now PI and P 2 can be joined by an arc made up of a finite number of straight 
line intervals containing no point of MZ , since M z does not disconnect S ; 
and since M 2 + PI P 2 does not disconnect S , it is obvious that P 2 and P 3 
can be joined by a similar arc which contains no point of MZ and has only 
P 2 in common with PI P 2 . Similarly there is a broken line, an arc P 3 PI , 
containing no point of M z , and having only P 3 and PI in common with 
PI P 2 -+- P 2 P 3 . Now any one of these three arcs will contain only a finite 
number of intervals lying except for their end points in W and having their 
end points on W . These intervals will be of two kinds, those whose end 
points separate A and B on W , and those whose end points do not separate 
A!_ and B on W . Consider an interval Xi X 2 of the second sort. Let 
X i X Xz denote that arc on W which does not contain A or B . Then 
obviously X\ Xz + Xi X Xz is a simple closed curve enclosing no point of 
KI or KZ , and by the method employed in Theorem 3, we can draw an arc 
Xi X z such that Xi Xz is within this curve and contains no point of either 



* Rosen thai gives a proof for the case in which each of the sets KI and K 2 reduces to a 
single point. See A. Rosenthal, loc. cit., p. 102, Theorem 6. 
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MI or Mz . Let the original interval X\ Xz be replaced by this arc X\ X 2 
and let this process be carried out for every such interval, of the second kind, 
on each of the three arcs. Now each one of the three must have at least 
one interval of the first kind; for since no one of these arcs has any point of 
MI on it and all of MI lies within W and every interval of the second sort of 
the arcs lying within W has been replaced by an arc containing no point of 
either A/\ or M 2 , if one of these arcs, PI PI , for instance, had no interval of 
the first sort on it, it would be replaced by an arc PI P 2 which had no point 
of MI or MZ on it, which is contrary to our supposition. Let FI F 2 be the 
first interval on PI PZ in the order PI PZ which separates A and B on W , 
let Y 3 F 4 be a similar arc on P 2 PZ , and let F 5 F 6 be a similar arc on PS PI . 
Some two of the points YI , F 3 , F 5 must lie on the same arc A B of W; sup 
pose YI and F 3 do (Fig. 7) . Now consider the simple closed curve 




FIG. 7. 

F = FI F 3 F 4 Yz YL . Since it contains neither A nor B it encloses no point 
of either K\ or KI . Now there is no connected subset of MZ lying on or within 
T and having a point X 3 on FI F 3 and a point X 4 on F 2 F 4 . For such a set 
would separate A and B on W and would divide the interior of W into two 
sets of which one contains KI and the other contains K 2 . But KI and K 
are connected within W by a set of points belonging to MI . This set would 
have to cross the subset of M 2 lying within F . But this is obviously impos 
sible. Then by Theorem 2 there is a division of the points of l/ 2 that lie on 
or within F into two mutually exclusive closed sets, Zi and Z 2 , such that Z\ 
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contains all the points of M-i that lie on FI Y 3 . Now all points of Zi can be 
enclosed in a finite number of circles not containing any point of F 3 F 4 + 7 4 F 2 
+ F 2 Fj_or any point of Z 2 or MI, and evidently there is an arc lying on or 
within Y, a subset of these circles together with intervals of FI F 3 , which 
contains no point of either MI or M 2 Then PI and P 2 are joined by an arc 
not containing any point of either MI or M 2 . Since this is contrary to our 
supposition the theorem is proved. 

UNIVERSITY OF TEXAS, 
AUSTIN, TEX. 



ON THE LOCATION OF THE ROOTS OF CERTAIN TYPES 

OF POLYNOMIALS* 



BY 

J. L. WALSH 



When we study the dependence of a variable on k other variables which 
vary independently, our problem may be very much simplified if we can con 
sider all or some of these independent variables to coincide and thus study the 
dependence of our original dependent variable on one new variable or at least 
on a number of new independent variables less than k . The present writer 
has recently published a theorem (Theorem II, below) which enables us to 
make a reduction of this sort in the study of the relations between the roots of 
certain types of polynomials. The present paper aims to prove Theorem I 
(below), which is a much more general result of the same nature, and to indicate 
various applications of that theorem. The applications given are extremely 
simple and follow from Theorem I with practically no further machinery, f 
The most interesting application is Theorem VI. 

Our problem is, more explicitly, to study the geometric relationship of the 
roots of a polynomial 

/(z) = (z oi) (2 a 2 ) (z a m ) 
to the roots of a related polynomial 



* Presented to the Society, December 29, 1920, and April 23, 1921. 

t When this paper was first offered for publication, the writer believed Theorem I to be 
new. Professor D. R. Curtiss has kindly pointed out its connection with a theorem due to 
Grace and has indicated an entirely new proof of Theorem I; the reader will refer to Professor 
Curtiss s note which immediately follows this paper. 

The point of view of the present paper in the proof and application of Theorem I seems to 
be new, and also the results obtained except where otherwise stated. 

This paper is the development of a short note published in the Paris Comptes 
Rendus, March, 1921, to which explicit reference is made later, and which contained in 
outline the proof of Theorem I. In the interval between the publication of that note and the 
publication of the present paper, there have appeared a number of other papers dealing with 
Grace s Theorem. See Szego, Mathematische Zeitschrift , vol. 13 (1922), pp. 28-55; 
Cohn, Mathematische Zeitschrift, vol. 14 (1922), pp. 110-148; Egervary, Acta 
Litterarum ac Scientiarum, Regiae Universitatis Hungaricae Fran 
cisco- Josephinae, vol. 1 (1922), pp. 39-45; Fekete, same Journal, vol. 1 (1923), pp. 
98-100. 

The present paper has very little in common with these other papers, but with Szego s 
Theorem 9 the reader should compare our Theorem X, and with Szego s Theorems 13-15 
compare our Theorem IV and also Walsh, American Mathematical Monthly, vol. 

29 (1922), pp. 112-114. 
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whose roots are supposed to be known.* When we study the functional 
dependence of a root of/, say ai for definiteness, on the b i} it may occur that 
without changing ai we can replace the n roots 6 Z - by n roots of a polynomial 
(f&gt; which coincide at some point b which bears a simple relation to the roots &, . 
Then we can study the dependence of ai on b instead of on &i , b 2 , , b n , 
which is frequently a simplification of our problem. 

To the plane of the complex variable we adjoin the point at infinity; infinity 
is to be considered simply as an ordinary value of the variable. As a geomet 
rical consequence we shall consider the term circle to include the possibility of 
a straight line. We shall have occasion to deal with circular regions, by which 
we mean a closed portion of the plane bounded by a single circle, that is, the 
interior of a circle, the exterior of a circle (including the point of infinity), a 
half-plane, a point, or the entire plane; the points of the boundary are always 
to be included in the region. 

I. A GENERAL THEOREM 

We proceed to the proof of our main result: 

THEOREM I. Letf(z) be a polynomial in z whose coefficients are polynomials 
linear in and symmetric in each of the sets of variables 

{!, 2, , Oik] , {01,02, ",/3,}, , {Xi, X 2 , , X 4 } .| 

Let these points {ai} , {/?ij , , {A;} lie in circular regions C a , Cp, , C\. 
Then for any fixed values of these variables and of z we can always make all the 
{ai} coincide in C a , all the {Pi} coincide in C ft , etc., without altering the value 
off(z). 

The theorem also obtains if we replace/ (2) by the quotient of two poly 
nomials of the type described, except that we are to consider the conclusion of 
the theorem satisfied if these two polynomials vanish simultaneously. J Of 
course if we have two polynomials of the type required, their sum is also of 
that type. 

We shall prove the theorem considering the {&}, {7^}, , {A,-} always as 
fixed and showing that we can make the {;} coincide in C a as stated. Then 
we can consider the [at] fixed and coincident, and the {7;}, , {X t -} fixed; 
our former reasoning will show that the {&} can be made to coincide. Con 
tinued reasoning in this manner will evidently complete the proof of the 
theorem. It will be convenient to assume that the value of /(z) considered 

* Professor Curtiss has recently published a very interesting report on this general field, 
Science, vol. 55 (1922), pp. 189-194. 

t The coefficients off(z) need not be homogeneous in each of these sets of variables, but 
each coefficient must be a linear combination of the elementary symmetric functions of each 
of these sets with coefficients linear combinations of the elementary symmetric functions of 
the other sets. These linear combinations may, moreover, contain constant terms. 

t This is what actually occurs in the situation of Theorem II if we choose P inside C . 
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in the hypothesis is zero; this involves no loss of generality, for the addition 
of a constant term to/(z) does not alter the properties required. 

We now wish to show that the a can all be made to coincide. If the region 
C a is a point, the statement is trivial. If we prove the theorem where C a is a 
circular region not the whole plane we have proved it where C a is the whole 
plane. 

Consider the a; to vary independently and to have the region C a as their 
common locus. Then the relation 

(1) /00 = 

defines z as an analytic function of the a; and hence z will have a certain locus 
Z; this locus Z will be a closed point set since C a is closed. If (1) degenerates 
and does not effectively contain z at all, we introduce an auxiliary variable f 
by placing 



the new function F ( f ) surely contains f and we may reason with it as before 
far/(a). 

Let a be any point interior to C a . There is a certain locus Z of points z 
corresponding to the null-circle a as the locus of the {at}. Make use of the 
auxiliary circle C of the coaxial family determined by a and C a . Let C com 
mence with the position a and gradually enlarge and coincide with C a , the 
region bounded by C and containing a always considered as the locus of the 
{}. The locus Z also grows larger and varies continuously with C (except 
as noted below), for the roots z of (1) are continuous functions of the { &lt;*}. 
When the region C continues to enlarge, beyond C a if necessary, the locus Z 
eventually becomes the entire plane; this happens ordinarily before C has 
swept through the entire plane. The point z with which we start is a point 
of the locus Z surely when C coincides with C a and possibly before. 

The case that the locus Z does not vary continuously with C occurs only if, 
for some choice of the {cti}, (1) vanishes identically in z. When this occurs, 
the roots {z} of (1) do not vary continuously with the {;}. The locus Z may 
enlarge suddenly and become the entire plane while its boundary does not 
necessarily sweep over the whole plane. 

Whether or not this phenomenon takes place, corresponding to our original 
point z there is some circle C such that for no circle C , the region C smaller 
than and entirely contained in the region C , can z be a point of the locus Z 
of the roots of (1). This statement and in fact Theorem I as well are true if 
z is a point of the locus corresponding to a. The statement is true in any 
other case. For if there is a sequence of sets of points \a.i\ in a sequence of 
regions each contained in the preceding, there is at least one limiting set of 
points {a} contained in all the regions and hence in the limit region. The 
relation (1) obtains for this limit set. 
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Our entire proof of Theorem I rests on the remark that when we fix z and 
all but two of the points {on}, equation (1) becomes a homographic and 
involutory relation between the other two of these points and hence when one 
of these points traces a circle the other also traces a circle. The proof is com 
plicated, however, by the possibility that this relation may degenerate and 
may effectively contain but one or neither of these variables.* 

We prove now a very special case of Theorem I, namely, that if a circular 
region C contains two points ai and a 2 connected by a relation of the form 

(2) Cti Oil Oil 4- 2 ( 1 + &lt;*2 ) + 3 = , 

then C contains a root a! of the equation 

(3) ai a" + 2a 2 a! + a 3 = . 

Transform (2) by an auxiliary linear transformation so that the point at 
infinity is a double point of the transformation (ai, 2 ) defined by (2). The 
line through a\ and 2 is transformed into itself by the transformation ( ai , 2 ) , 
for the three points ( a\ , &lt;x 2 , ) go into ( 2 , a\ , o ) . When a\ is moved on 
this line, a 2 moves on the line in the opposite sense, for if the two points move 
always in the same sense, when a\ is moved along the finite segment from its 
original position to the original position of a 2 , a 2 has moved in the same sense, 
has at no time coincided with the point at infinity and hence has not reached 
the original position of a\ . The points a-i and 0:2 moving on the line joining 
them in opposite senses must have a double point D between them. It is 
thus evident that the region C which contains i and 2 must contain D or the 
point at infinity and hence a solution of (3). 

The reasoning just used supposes implicitly that the relation (2) does not 
degenerate. If (2) does not contain a-&gt; for a particular value of a\ , that value 
of ai is a solution of (3). If (2) is satisfied identically, any values ai and a 2 
suffice. 

We now return to the proof of Theorem I in its general form. Let C be the 

* Thus we may start with the auxiliary polynomial 

F(a) = (a - i)(a- ) (3 - at) k&gt;3 

and consider for the polynomial of Theorem I 

/()-P (). 

The relation 

(! ) /U)=0 

if ctk-i = ai c = z contains effectively none of the variables i , a?. , , ak-z If at = z , 
we have ( 1 ) reducing to 

(Z l) (Z - 0-2) (Z ajfc_i) = 0, 

and hence we have no effective homographic relationship between &lt;**_! and a* . 

This example is not an unnatural one ; indeed it is one of the first to which we should think 
of applying Theorem I ; compare Theorem II and its applications to which a reference is given. 
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smallest region of the kind already described for which z is a point of the 
locus Z . There is at least one of the points on on C . We shall show that all 
those points can be made to coincide on C . 

Two points ai and a 2 which are on C can be made to coincide either on C 
or within C. For we fix z and the remainder of the points {en} and hence 
have a relation of type (2) between i and a 2 . A double point of the trans 
formation is either on the circle C or interior to the region C . 

Let us combine in this manner as many of the points {} on C as possible, 
so as to leave the smallest number of points { t j actually on C . This number 
of points we denote by n; it is greater than zero and cannot be reduced. 

We shall prove by induction that all these n points on C can be made to 
coalesce on C . This fact is evidently true for two points, for no double point 
of the corresponding transformation can be interior to C. We assume ex 
plicitly that whenever there are given n 1 points of this sort on an arc A 
of C, they can be made to coincide on A without changing any other point 
at, or the relation (1); we shall prove this fact for n points. It involves no 
loss of generality to suppose n - - 1 of these points at an end point X of A and 
the nth point at the other end point Y . 

We consider one of the points a t at X and the point a n at Y as connected 
by (1), while z and the other {a^} remain fixed. When cti moves on C , ct n 
moves on C in the opposite sense, and the transformation (ai, a n ) has a 
double point D interior to A . Consider a\ and a n to coincide at D . The 
new arc bounded by X and D contains the n points. We can make coincide, 
at a point X , n -- 1 points ai, X on the arc XD , and the other point a n 
will be at D . Then we have the n points on an arc A bounded by X and Y , 
the arc A shorter than A and every point of A a point of A . Moreover, 
there are n 1 points at X , the end point nearest X , and the other is at 
Y = D , the end point nearest Y . 

We can continue in this manner successively to shorten our arc A , and we 
can shorten it indefinitely. For let us suppose we have a sequence of arcs 
(using the natural extension of our former notation), XY , X Y , X"Y" , , 
and suppose the points X , X , X" , have a limit point x and the points 
Y , Y , Y" , a limit point y different from x and of such a nature that we 
cannot make our arc of type A shorter than xy . We have n 1 coincident 
points di, successively approaching x, and one point a n simultaneously 
approaching y . From the continuity of the left-hand member of (1) we can 
therefore suppose these n -- 1 points on to coincide at x and the other point 
a n to lie at y . Then the procedure formerly used shows that we can bring 
the n points into an arc shorter than xy all of whose points are points of xy . 
This contradiction completes the proof of our statement that the arcs A , A , 
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A" , shorten indefinitely, and hence the points a\, at, , a n can all be 
made to coincide at the limit point P of those arcs, with (1) still satisfied. 

If k = n, Theorem I is proved. If k &gt; n, consider the transformation 
(OL\ , a k } defined as before. We have supposed that none of the points a\ , 0.1 , 
, a n can be moved from the circle C , so the transformation cannot contain 
dk and hence a* can be moved as near P as desired. Thus any other of the 
points atn+i , a n +2 , , ctk can be moved as near P as desired without changing 
the value of /(z) . Hence all these points can be made to approach P and so 
the value of /(z) is unchanged if all the a\, a 2 , , a.k coincide at P. 
Theorem I is now completely proved. 

II. SPECIAL CASES OF THEOREM I, WITH APPLICATIONS 
There are two results, essentially special cases of Theorem I, which are 

particularly interesting in their applications. 

THEOREM II. // the points on , a 2 , ,* lie in a circular region C and if 

z is exterior to C , the root of the equation in a. 

1 _,. 1 f h 1 _ & 

z ai z 2 z otic z a 

lies in C .* 

Theorem II is in reality a special case of the extension of Theorem I where 
we consider the quotient of two polynomials of the kind described. The 
denominator polynomial cannot vanish, since z is exterior to C . 

For applications of Theorem II we refer to the citation already made and 
that made in the next following footnote. We proceed to another special case 
of Theorem I : 

THEOREM III. // the points ai, az, , otk lie in a circular region C , the 
equation in a 
(4) (z - ai) (z - 2 ) (z - a fc ) = (z - a) fc 

has at least one root in C . 

Theorem III can be proved independently of Theorem I in a manner precisely 
analogous to the proof of Theorem II (loc. cit.). This proof of Theorem III 
involves a transcendental transformation of the a- (or (x, ?/)-) plane: 

a. z = x -f- iy = e u+iv , 

and a study of the transform of C and certain centers of gravity in the ( u, )- 
plane. This proof gives* in certain cases more detailed information than does 
Theorem I regarding the root a of (4). Thus if z is exterior to C, which is 

* Walsh, these^T ransactions, vol. 22 (1921), p. 102; Lemma I. 
Theorem II is closely connected with another more simple corollary of Theorem I, namely, 
that if fc equaljparticles lie in a circle their center of gravity also lies in that circle. 
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the interior of a circle, we can write 

k log (z a) = log (z ai) + log (z a 2 ) + + log (z a k ) , 

where all the logarithms involved have the coefficients of V 1 in their pure 
imaginary parts lying between two numbers whose difference is less than TT . 
We add the remark, without proof at this time, that there is no other finite 
region bounded by a regular curve which possesses the property of the finite 
circular region indicated by Theorem III.* 

Theorem III can easily be extended to give some information concerning 
the location of all the roots of equation (4) in a . In fact, if a is any root, the 
other roots are given by 

(z- a} k = (z- a } k , 
z a = u (z a } , 

where w is a kih root of unity. If we suppose that a is in C , all the roots of 

(4) must lie in the k circular regions obtained by revolving C about z as center 
of rotation through the angles 2jw/k ,j=l,2,---,k. In particular suppose 
one of these k circular regions, say C\ , is external to all the other k 1 circular 
regions. Then we can prove that C\ has on or within it precisely one root of 
(4). For consider the points i, 2 , , ctk to move continuously always 
remaining in C and to coincide at the center of C . In this situation C\ con 
tains precisely one root of (4). The roots of (4) vary continuously with the 
points 0:1,0:2, &gt;*; none can enter or leave C\ during motion of the kind 
indicated; so in the original situation Ci contained precisely one root of (4). 

However, our purpose is not primarily to study the roots of (4), but rather 
to use Theorems I and III as tools in proving more interesting relations. In 
preparation for these results we now prove the 

LEMMA. // the interiors and boundaries of the tivo circles C\ and C 2 , whose 
centers are a\ and a 2 and radii r\ and r 2 respectively, are the loci of two points zi 
and z-i , then as zi and z 2 vary independently, the locus of the point z which divides 
the segment (zi, z 2 ) in the constant ratio (TOI : m 2 ), 

(5) z = - ( TOI ^ m 2 , Wi m 2 ^ ) , 

is the interior and boundary of a circle C whose center is 

niz a.\ + mi ocz 

nil + m 2 
and whose radius is 



nil r-2 



t 



* For the corresponding fact for Theorem II, compare Walsh, these Transactions, 
vol. 24 (1922), pp. 31-69; Theorem III. 

f The terminology that z divides the segment (zi, z 2 ) in the ratio (mi : m 2 ) is usual when 
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If Zi and z 2 are on or within Ci and C 2 respectively, 
we have 



[October 



z 



+ 



+ i \ ! ***X / I i 

m 2 mi ~r m-2 ffii-r m 2 

which is in absolute value less than or equal to 



(z 2 



+ m 2 



-f- m 2 



It remains to be shown that given any point z on or within C , we can properly 
determine zi and z 2 In order to do this, we merely place 
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Then zi and z 2 lie in the proper regions and satisfy (5), so the proof of the 
lemma is complete. 

This lemma gives no result if mi + mz = 0; to treat this case we take (5) 
in the form 

(6) m 2 (z zi) = -- mi (2 z 2 ), 

which is equivalent for our present purpose. If Ci and C 2 are mutually ex 
ternal there is no point z different from the point at infinity which satisfies (6). 
If Ci and Cz are not mutually external we may choose zi = z 2 , and every 
point of the plane satisfies (6). In the theorems below we give the general 
formulas for the determination of C with the understanding that when 
mi + m 2 = 0, C is considered to contain no finite point of the plane or every 
finite point of the plane according as Ci and C 2 are or are not mutually 
external.* 

mi and m 2 are real; whether m t and m 2 are real or not, we simply understand that statement to 
mean that 2 is given by (5). 

This lemma was proved by the present writer and by the present method for mi and m 2 
real and positive, in a note in the Comptes Rendus du Congres international, Strasbourg, 1920. 
See also Theorem II of the reference given in connection with the present Theorem II. 

* Thus, a careful statement of Theorem IV for the case A = 1 is 

THEOREM IVa. // the poin s a lt a^, , a k have as their locus a circle Ci whose center is a 
and radius ri, and if the points bi, 6 2 , , b h have as their locus a circle C 2 whose center is p 
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We now apply Theorem III and the Lemma to prove 

THEOREM IV. // the points a^ , a 2 , , a k haw as their locus (the interior 
and boundary of) a circle C\ whose center is a and radius ri , and if the points 
bi , b% , , bk have as their locus a circle C% whose center is /? and radius rz , 
the roots of the equation 

(7) (z ai) (z a 2 ) (z a k ) A (z &0 (z 6 2 ) (z b k } = 
have as their locus the k circles with the respective radii 



and centers 

a - 






_ A iik 



where A llk takes all the k values possible. Any one of these k circles which is 
external to all the others contains precisely one root of (7) . 

It is evident that any polynomial equation can be written in the form (7), 
and in an infinite variety of ways. 

We prove Theorem IV as follows. If a point z is a root of (7), an application 
of Theorem III shows that 

(z- a) k - A(z- b) k --= 0, 

where a and b are some points on or within Ci and C 2 respectively. Hence 

we have 

z- a= A l/k (z- b), 

_a- A llk b 
1 - A llk 

where A llk is some kth root of .4 , so z is on or within one of the k circles of the 
theorem. Variation of a and b independently and over Ci and (7 2 as loci shows 
that the k circles are the actual loci of the roots of (7). Continuous motion of 
the points a\ , a , , a k , bi , 6 2 , , b h so as to remain in their proper regions 
and radius r 2 , and if Ci and Cz are mutually external, then the roots of the equation 
(T) (z ai)(z a 2 ) (z a k ) = (2 61) (2 & 2 ) (2 6*) 

have as their locus the k 1 circles with the respective radii 



\\-ia 
and centers 



1 - CO 

where co takes the k 1 values of the kth roots of unity which differ from unity itself. Any one of 
these k 1 circles which is external to all the others contains precisely one root of (T). 

If Ci and C% are not mutually external, the locus of the roots of (T) is the entire plane. 

The details of the proof of Theorem IVa are similar to those of the proof of Theorem IV 
and are left to the reader. 
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and finally to coincide shows in precisely the manner used in considering the 
roots of (4) that any one of the k circles which is exterior to all the others 
contains precisely one root of (7). 

Theorem IV reduces to the lemma when k = 1 . 

A. few remarks on the geometry of the situation are not out of place. The 
k points (8) all lie on the circle which is the locus of points z such that 

|z a I 



I* -01 

which circle is of course a circle of the coaxial family determined by a and /? 
as null circles. Each of the k points also lies on a circle which is the locus of 
points z such that 

z a 




illk 



that is, the argument of the ratio (z a)/(z /3) is constant. These k 
circles belong to the coaxial family of all circles passing through a and /3 , the 
family conjugate to the former coaxial family mentioned. These same k 
circles can be arranged in order so that at a. and ft each cuts its predecessor at 
an angle of 2ir/k . 

It is quite easy for us to obtain results concerning the roots of the derivatives 
of equation (7). The rath derivative (k &gt; m ^ 0), except for a constant 
factor, can be written 

(9) (z- oi)(z- 02) 0- aUO 

-A(z-b()(z-b 2 ) (z-6U) = 0. 



The points a all lie on or within C\ and the points & all lie on or within Cz , 
by the theorem of Lucas concerning the roots of the derivative of a polynomial. 
Hence all the roots of (9) lie on or within the k m circles with the respective 
radii 



and centers 

a 



\ _ 

where A ll(t ~ m) takes all the k m values possible. These circles form the locus 
of the roots of (9). Any one of these k m circles which is external to the 
remaining k m 1 circles contains precisely one root of (9). This new 
result includes Theorem IV as the case m . 
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We now state another theorem which results from Theorem III and which 
like Theorem IV refers to the sum of two polynomials.* 

THEOREM V. // the points a\ , a% , , a k lie on or within a circle C whose 
center is a. and radius r , all the roots of the equation 

(10) (z- 0^(2-02) (z- o fc ) -A--= 

lie on or within one of the k circles which have the common radius r and whose 
centers are the points a + A llh , where A vt takes all the k possible values. If these 
k circles are mutually external, each contains precisely one root of (10). 

Of course any polynomial equation can be written in the form (10) and in 
an infinite variety of ways. The proof of the theorem follows the proof of 
Theorem IV. If C is the locus of i , a 2 , ,*, the k circles form the locus 
of the roots of (10). 

Equation (10) is particularly interesting because the derived equation is 
independent of A . All the roots of the rath derived equation, m &lt; k , lie on or 
within C . Theorem V can also be extended to a polynomial of the form 

r [(z ai) (z a 2 ) (z a k )] A = 0, 
dz 

but this generalization as well as Theorem V itself is included in a more general 
theorem to be proved from Theorem I. 

III. SOME DIRECT APPLICATIONS OF THEOREM I 

We now proceed to derive a number of results directly from Theorem I 
instead of from Theorem II or Theorem III. 

If in Theorem I the polynomial /(z) contains merely one set of variables 
{af} referring to a single circular region, the locus Z can have as its bound 
ary no point other than a point of the curve traced by z when a = a\ = a z 
= = ctk traces the circle C a . This leads to 

THEOREM VI. Let the common locus of all the roots of a polynomial 

/(z) = (2 i) (z a-i) (z a k ) 

be the interior and boundary of a circle C whose center is a and radius r . Then 
the locus of the roots of the polynomial 

(11) A f(z) + A 1 f ()+... + A^f^-v (z) + A k f&gt; (2) 

is composed of the interiors and boundaries of the circles whose common radius is 
r and whose centers are the roots of 

(12) A (z- a)* + k4i(z- a)*- J + +k(k- 1) ---2 lA k . 

* If in Theorem IV we replace (7) by the sum of two polynomials not of the same degree 
and neither of degree zero, we are led to the determination of a locus which is not generally 
bounded by circles. Compare 4 of the next to the last paragraph of this paper. 
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Any of these circles having no point in common with any other of these circles 
contains a number of roots of (11) equal to the multiplicity of the center of that 
circle as a root of (12). 

A detailed proof of this theorem is quite simple. For any particular root 
z of (11) we consider the points {a^ to coincide at a. on or in C . We have then 

A (z - a) k + kA! (z - a)*- 1 + +&(*-!) 2 L4 fc == 0. 

Since | a a. \ Si r , we know that z must lie in or on one of the circles de 
termined. Conversely, any point z in or on one of these circles is a root of 
(11) for some choice of the {ai}; denote the center of the circle by /3. We 
need merely choose aj. = 0:2 = = ak = a -}- z /? , which is a point on or 
within C . 

If one of the circles C\ has no point in common with any other of those circles, 
we may vary the a.i at will and no root of (11) can enter or leave C\; these roots 
vary continuously if the a,- vary continuously. In particular if 

i = 2 = = oik = a , 

we have the number of roots of (11) proper to Ci . It is similarly true that 
a number of these circles C\, , Ci having no point in common with any 
other of the circles contain a number of roots of (11) equal to the sum of the 
multiplicities of their centers considered as roots of (12). 

Theorem VI enables us to give a result concerning the location of the roots 
of the sum of any number of arbitrary polynomials;/ (z) is to be chosen one 
of those polynomials of highest degree. 

If in Theorem VI we place A = A 2 = A 3 = = A& = 0, AI 5^ 0, 
we have 

A circle which contains all the roots of a polynomial contains also all the roots 
of its derivative. 

This is essentially the theorem of Lucas: 

A convex polygon which contains all the roots of a polynomial contains also all 
the roots of its derivative.* 

The similar results obtained from Theorem VI by placing AI ^ but all 
the other A t zero (I &gt; 0) give the theorem of Lucas for the other derivatives 

of/00. 

Theorem I applies just as well to the integral of a polynomial as to its 
derivative, but for a result of the nature of those just given it is necessary to 
choose a particular integral. Thus we shall prove 

THEOREM VII. Let the common locus of all the roots of the polynomial 

/(z) = (2- i)(z- 2 ) (z - a fc ) 

* In both of these statements as well as many other similar results, the term contains may 
be interpreted to include the possibility of points on the boundary or to exclude that possibility. 
Whichever interpretation is considered in the hypothesis, the conclusion will be true under 
that same interpretation. 
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be the interior and boundary of the circle C whose center is a and radius r . Then 
the locus of the roots of the polynomial 



is composed of the interiors and boundaries of the k + 1 circles whose centers are 
(1 03 ) a and radii 1 1 a&gt; r , where 03 takes all the values of the ( k + l)st roots 
of unity. Any of these circles which is entirely exterior to all the others contains 
precisely one root of F (z) . 

If z is a root of F (z) for particular values of the a, we must have for some 
choice of a. = a.\ = 2 = = ah in or on C , 

(2-5)*" (-a)^ 1 

- 



where O3 fc + - 1 . From this equation we have at once the desired inequality: 

z (1 -- 03 ) a = (1 oj ) ( a a ) , 
\z (1 03)0: =|l oj|r, 

which shows that z is on or within one of the k + 1 circles. On the other hand, 
any point z on or within one of these k + 1 circles is a root of F (z) for proper 
choice of the {;}. In fact, if 03 is the particular root of unity corresponding 
to that circle we have merely to place a == &lt;xi = a 2 = = ak , where 

- ( 1 03 ) a 

a. a = - 

1 -- 03 

the exceptional case o&gt; = 1 is trivial. 

The proof of the theorem is now complete except for the last sentence; this 
is proved in precisely the same manner as the corresponding statement in 
Theorem VI. Like Theorem VI, Theorem VII illustrates the remark made 
just previous to the statement of Theorem VI. 

We shall next prove further results concerning the roots of the derivative 
of a polynomial, making continued use of the Lemma. The special case of 
the Lemma where m\ and m z are positive is particularly simple; this special 
case leads to 

THEOREM VIII. Denote by g (z) the polynomial 

(Z Zi) 7 " 1 (Z Z 2 ) m2 

and by m ( f : m { } ( n = 1 , 2 , , m ) the ratios in which the m distinct roots 
z () of g w (z) (the kth derivative of g(z)) divide the segment (zi, 22) * Let the 
interiors and boundaries of circles Ci and Cz whose centers are ai and a 2 and radii 
r\ and r% be the loci respectively of mi and m% roots of a polynomial f (z) which 

* The reader will easily prove from Rolle s Theorem that no point z (n) distinct from z t and 
02 can be a multiple root of &lt;7 (i) ( z ) . 
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has no other roots. Then the locus of the roots of f (k) (z), the kth derivative of 
f(z),is composed of the m circles (7 (n) whose centers are 



and whose radii are 



m ( 2 n) 



// one of these circles C (n) is exterior to all the others, it contains a number of roots 
//(z ) equal to the multiplicity of z (n) as a root of g (k) (z ) . 

The proof of this theorem is quite similar to the proofs of Theorems VI and 
VII and will therefore be omitted.* 

If a number of circles C (n) of Theorem VIII overlap and are entirely exterior 
to all the other circles C (n) , they contain together a number of roots of f w (z) 
equal to the sum of the multiplicities of the corresponding roots z (n) of g (k) (z ) . 
A similar situation for one as for several circles arises in connection with 
Theorems VI and VII. In the future we shall indicate this general fact by 
saying that the number of roots of f (k} ( z ) proper to a circle C (n) is the multi 
plicity of the point Z M as a root of g (k&gt;&gt; ( 2 ) . 

In Theorem VIII we are in reality considering a polynomial 

/(z) =0(z) *(*), 

where the mi roots of (z) lie on or within Ci and the m 2 roots of ^ (z) lie on 
or within &lt;7 2 Then our conclusion refers to the polynomial 



/&lt;*&gt; (z) = 

When the result is expressed in this form it can be given a large extension. 
Under the same hypothesis with respect to &lt; and \f/ we consider the polynomial 

(13) A &lt;t&gt; $ + Ai4&gt; &gt; ^ (fc - J) + A, 4&gt;" ip- 2) + + A k & $ . 

If z is a root of (13) for a particular choice of the roots of 4&gt; and ^ , there exist 
o; and /3 in or on Ci and C 2 respectively and such that 

(m 2 - k + l)A (z - a) m i (z - $} m *- k 

-- 1) (mz - k + 2)Ai (z - a)" 11 " 1 (z - 

C14) + wi (mi 1) m 2 (m 2 1) 

(z - a) m i- 2 (z - 



+ mi (mi --!) (mi -k + l)A k (z - a} m ^ k (z - /3) m: &lt; = 0. 



* This theorem was published in a short note by the present writer, ParisComptes 
R e n d u s , vol. 172 (1921), pp. 662-664, and its proof as there indicated contains the germ of 
the proof of Theorem I. The special case of Theorem VIII for the case fc = 1 had been 
previously proved by means of Theorem II; see the reference given in connection with that 
theorem. 
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Equation (14) is essentially an equation in homogeneous coordinates /zi, nz 

m z (m 2 - 1 ) (m* - k + 1 ) A ^ ^~ k 

+ TOI m 2 (m 2 -- 1 ) (m 2 - A; + 2 ) AI /x? 1 " 1 M 2 "^ 1 
(15) + mi (mi -- 1) m 2 (m 2 - 1) (m a - k + 3 ) A 2 M? 1 2 M? 2 ~* +2 

+ mi(mi - 1) (mi - & + 1 ) 4 t M ?i- fc /tf* = 0, 

whose distinct solutions, m in number, we denote by jui n) : ju 2 n) , n = 1 , 2, 
, m. From (14) and (15) we see that for some n we have 

z - a _ M ( i re) /ifa + jffiS 





The Lemma immediately gives us certain circles in at least one of which must 
be located the point z . It is easily seen that these circles give the exact locus 
of z , and we have 

THEOREM IX. Let the interiors and boundaries of circles Cj and Cz, whose 
centers are a.\ and 0:2 and radii r\ and r% , be respectively the loci of the mi roots of 
a polynomial &lt;f&gt; and the m% roots of a polynomial \{/ . Then the locus of the roots 
of (13) is composed of the interiors and boundaries of the m circles C M whose 
centers are the points 



and radii 

) f " ) 2 , 9 

(n = 1,2, 



where ^ : /4 n) are the m distinct roots of (15). The number of roots of (13) 
proper to a circle C^ is the multiplicity of the ratio /4 n) : ju 2 n) as a root of (15). 

The methods that we have been using enable us also to obtain a result for 
the roots of the derivative of an entire transcendental function of the simple 

type 

/(z) =&gt;. Q( Z ) 

where 

P ( z ) = a(z 0*1) (z a 2 ) ( z a p ) , a^O, 
Q(z) =b(z -ft) (a -ft) (z-/3 9 ), 6^0. 

The roots of/ (z) are given by the equation 

(16) Q(z)P (z) + Q (z) =0. 

If all the on and /3,- lie in or on a circle C whose center is a. and radius r , and 
if z is a root of (16), we shall have for some a and /3 in or on C 

ap(z - j8)(3 - a)^ 1 + ?(z - i^) 9 " 1 = 0. 



2 _ 7 = co A I - &gt; where co p = 1 . 
ap 
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Then we must have either 

(z -is) 9 " 1 = 
or 

op(z -J8)(z - a)^ 1 + 9 = 0. 

If this latter equation is verified we find from Theorem III that there exists a 
point 7 in or on C such that 

ap(z - j) p + q = 0, 

p l~ 1 

7 = co -\ - w 

\ ap 

It therefore follows that 

z - ( a + co 4^ - ) = 7 -- a , 

V X op ) 

which is less than or equal to r in absolute value. Then all the roots of / (z) 
lie in C and the p circles of common radius r whose centers are the points 

pi q 

a + co +1 
M ap 

We state this fact and its obvious converse in 

THEOREM X. Let the interior and boundary of the circle C whose center is a 
and radius r be the common locus of all the roots of the two polynomials 

P (z} = a(z ai}(z ai} (z a p ) , a ^ 0, 
Q(z) = b(z -/3i)(z -ft) (-&), b^O. 
Then the locus of the roots of the derivative of the function 

*&gt; Q(z} 

is composed of C (unless q = 1 ) and the p circles whose common radius is r and 
whose centers are the points 

P \ q 

Of + CO A I - , 

\ ap 

where co takes the values of the pth roots of unity. There are q -- 1 roots of the 
derivative proper to C and one proper to each of the other p circles. 

In the proof of this theorem we have used not the fact that the interior and 
boundary of C is the locus of the roots of P ( z ) but that it is the locus of the 
roots of P (z) , which gives in reality a more general result. Thus if p 1 , 
there is no restriction whatever on the root of P ( z ) . 

We now prove a theorem which is a generalization of our result concerning 
the roots of (16) corresponding to our former generalization of Theorem VIII. 
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THEOREM XI. Let the interior and boundary of a circle C whose center is a 
and radius is r be the common locus of the roots of the polynomials fa, fa, , &lt;j&gt; q 
of respective degrees ki,k z , ,k q . Then the locus of the roots of the polynomial 

(17) fa- fa- fa ..... &lt;{&gt; q 



is composed of the circle C (unless m = Id for i = 1,2, , g ) and the circles, 
ni -\- nz + + n q in number, whose centers are the points 

a + w[ti (k! -- 1) (h -ni+ 1) fc 2 (k z -- 1) 

(fc -n, + l) k g (k q - 1) 

(k q - n g + 1 ) A ] 1 "i+"+-+*&gt; 

where co takes all possible values of an ( n\ + n 2 + + n q )th root of unity, and 
whose common radius is r . There are 

n, + + n q ) 



roots of (17) proper to C and one root proper to each of the HI + n&lt;&gt; + + n q 
other circles. 

Under the hypothesis of the theorem, if z is a root of (17) there exist points 
a , a" , , a (9) on or within C and such that we have 

(z - a )*i(z - a")* (2 - &lt;&gt;)* - [A-i (A-i -- 1) 

(fci - m + 1) ft (fc&gt; -- 1) (fcs - m + 1) fc, (fc a -- 1 ) 

(ft, - n 9 + 1) A (z - a )*i-i (z - ")*r^2 



Then 2 must satisfy one of the equations 

(z - a ) k i- ni (z - a ")^-2 ... ( z _ a &lt;9))*r- = Q, 

(2 - a )"! (z - a")"* (2 - a&lt;9&gt;) 

- Ai(fti -- 1) (k g -n q + I) A = 0. 

If z satisfies this latter equation we know from Theorem III that there exists 
a point a on or within C and such that 

(z - a)i+ 2 +-+, = k! ( h - 1 ) (k q - n q + 1 ) A . 
That is, we have the equation 

z - [a + [fci (k - 1) (Ar a - TO, + l) &gt; 4] 1 ("-H4+-Hs)} == 5 - a, 

where co" 1 " 1 "" 2 ^"" 1 " 9 = 1 . The absolute value of the right-hand member is not 
greater than r, so z must lie in or on one of the n\ + n 2 + + n q circles of 
the theorem. 

The detailed proof of the remainder of Theorem XI now requires no further 
analytical work and is left to the reader. 

Trans. Am. Math. Soc. 13. 



180 J. L. WALSH 

The present paper aims merely to give some of the more immediate results 
that can be obtained from Theorem I. Further results can be obtained by 
extending in various directions: 

1. Application of these methods to other and more general types of 
polynomials. 

2. Extension of the results to other circular regions, namely, half-planes and 
the exteriors of circles. 

3. Expression of the results in a manner which shall be independent of linear 
transformation of the complex variable. 

4. Detailed determination and study of loci which naturally arise and which 
are of the same general nature as the locus determined in the lemma. For 
example, if the loci of a and /3 are two circular regions, determine the locus of 
the points z defined by 

(z - a) m = A (z - (8) n . 

For only the simplest polynomials /(z, &lt;*;, /3;) do we get a number of 
circular regions as the locus of the roots of / when the loci of the roots of the 
a.i and j3{ are circular regions. All the problems of the present paper have 
been characterized by a certain linearity, either the original problems them 
selves or those problems after a simple transformation. In more general 
problems, where the locus of the roots does not ordinarily consist of circular 
regions, the exact locus should be determined and also a simple even if rough 
approximation. 

5. Application of these results and methods to the case of real polynomials.* 
The present writer hopes to return later to a consideration of these questions. 



* Thus the following theorem results from Theorem VI: 

Let all the roots of a polynomial / ( 2 ) of degree k lie in an interval of the axis of reals whose 
center is a and length I. Denote the roots of the polynomial 

(! ) A Q z k + kA l z k -i + +k(k - 1) -2-1- A k 

by Zi, zt, , Zk , which roots are supposed real. Then all the roots of the polynomial 

(2 ) Aof + A 1 f + + A k fW 

lie in the intervals of the axis of reals whose centers are the points a. + Zi (i = 1,2, , k) 
and whose common length is I . The number of roots of (2 ) proper to each of these intervals is the 
multiplicity of the corresponding Zi as a root of (! ) 
HARVARD UNIVERSITY, 

CAMBRIDGE, MASS. (JUNE, 1921.) 



A NOTE ON THE PRECEDING PAPER * 

BY 

D. R. CURTISS 

The results of Dr. Walsh s paper are so interesting and capable of so many 
applications that it may be worth while to indicate a briefer proof of his funda 
mental theorem which connects with earlier work in this field. 

First let us note that Walsh s Theorem II is a form of Laguerre s Theorem 
given below. In fact, if we write 

/(z) = (z - on) (z - 2 ) (z - a k }, 
Walsh s relation 

+ + 1 k 

z a\ z a-i z a.k z a 

becomes 

f (z) k 

/(z) z a 
or 

(1) o, = z Iff^ 2 

\i) *&gt; K j-, , \ 



Walsh s Theorem II states that if the roots of / ( 2 ) are in the circular region C , 
and if z is exterior to C , then a lies in C . This is precisely the result obtained 
by Laguerre as given on page 59 of volume I of his collected works, if expressed 
in non-homogeneous coordinates as on page 57. 

We shall use another form of Laguerre s Theorem (loc. cit., p. 57) to prove 
the underlying proposition on which Walsh bases his proof of his Theorem I, 
as follows: 

Every circle through any point z and its " derived point" a as defined by (1) 
either passes through all the roots off(z) , or else has at least one root in the region 
interior to it, and at least one root in the exterior region. 

I have recently called attentionf to the fact that this is a corollary of a 
theorem of Bocher s on jacobians which has served as a starting point in 
Walsh s earlier papers. 

With a slight change in Walsh s notation, whereby we substitute z s for ex s, 
the proposition from which Theorem I is deduced may be stated thus: 



* Presented to the Society April 14, 1922, under the title On the zeros of successive polars of a 
binary form. 

t S c i e n c e , vol. 55 (1922), p. 193. 
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Letzi,z 2 , -" ,z k lie within a circular region C , and let X* denote the sum of 
all products of the z s taken r at a time. Then the equation 

(2) z k + k ai z*- 1 + k ~ L -^ 02 z k ~ 2 + + ka^ z + a k = , 

where 

- a k = a X* + i Z*-i + 2 2fc-2 + + fl ft-i Zi 



at least one root in C for each set of values a&lt;&gt; , a\ , , au-\ 
This theorem is trivial if all the o s are zero. In any other case we proceed 

by induction, assuming our proposition true for k = r 1 and proving its 

validity for k = r . It is obviously true for k = 1 . 

Let us place k = r in (2) and in (1). The resulting left-hand member of (2) 

we will use as/(z) in (1). We write 

/ ( z ) = z r + rai z r ~ l -f- + a r , 

so that (1) becomes 

,Q a = _ _oi z 1 - 1 + ( r - 1 ) a 2 z 1 ^ 2 + + a r 



1 + ( r 1 ) O! z 1 " 2 + + a r _ 



Now consider the equation obtained by replacing a. in (3) by z r ; cleared of 
fractions and arranged according to powers of z this is 



(4) (a z r + a l )z r - 1 + (r - 1 ) (a^z r + a 2 )z ~ 
But if we compare this with (2), and note that 



a 



\""^ r 1 / V &gt; r 1 

= - a r -T.Z r + C Zr Z^r-1 + 1 (Z r 2^r-2 



= Z r + 01 r- + 1 2r + 2 r-2 

+ +( a r _ 2 z, + o r _i ) Zi" 1 

we see that (4) is a case of (2) for k = r 1, and hence (4) has a root Z r in C . 
We now give z the value Z T in (3) and obtain the result that the "derived 
point" a is z r . By Laguerre s Theorem every circle through Z r and z r either 
has on it a root of /(z) or has a root in its interior and one exterior to it; but 
Z r and z r are in C , so that we can draw through them a circle interior to C . 
Thus/(z) has a root in C, and on completing our induction we see that (2) 
has a root in C . 
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The theorem we have just proved can not, however, be regarded as a new 
one. It is equivalent to the theorem of Grace:* 

If a form f ( z ) is apolar to a given form &lt;/&gt; ( z ) , then it has a zero within any 
circle enclosing all the roots of &lt; (z ) . 
If equation (2) and the equation 

(5) (z -ai)( -22) ---.(a -z t ) = 

are made homogeneous, the condition that (2) and (5) be apolar is precisely 
the relation just following (2). Since this holds, Grace s theorem requires 
that a root of (2) lie within any circle enclosing all the roots of (5), which are 

Zl , Z% , , Zk 

Grace s theorem is a consequence of Laguerre s when the latter is put in 
homogeneous form. If in (1) we replace a. by /??, z by x/y , and/ (2) by 
y k f( x &gt; y), (1) takes the form 

(6) #* + i?/; = 0, 

which is the equation of the first polar of ( , r\ ) with respect tof(x,y). One 
form of Laguerre s theorem (loc. cit., p. 59) may, then, be thus stated: 

// the point a = /rj is without a circle C containing all the roots of f, all the 
roots of the first polar of a icith respect to f are within C . 

Consider a set of points Zi,z z , ,Zk, and let /i be the first polar of Zi with 
respect to/, let/ 2 be the first polar of z 2 with respect to/i, etc. If all these 
z s are outside C , the roots of all the polynomials /i , / 2 , , fk-i are within C 
If (2) and (5) are apolar, z k is the root of // ; _i , and Grace s theorem, in which 
/ and &lt;f&gt; may be interchanged, follows directly. 

The "mixed polars " fi , / 2 , are perhaps worth writing down in another 
form. Thus if we use the notation, for r &lt; k , 

\1/(Z) = (Z - Zi) (Z - 2 2 ) (2 - Zr) = CO Z r + C i Z^ 1 + + C r , 

we have 



In non-homogeneous notation the equation f r = becomes 

* The zeros of a polynomial, Proceedings of the Cambridge Philo 
sophical Society, vol. 11 (1900-02), p. 352. See also Szego s paper, Bemerkungen 
zu einem Satz von J. H. Grace iiber die Wurzeln algebraischtr Gleichungen, Mathematische 
Zeitschrift, vol. 13 (1922), p. 28, in which the "Faltungssatz " is, in a sense, the com 
plement of the theorem proved above. 
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,,,. i } dtd-if (k- r 
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rl dz r 



If every root offis within a circle C , and every root of ^ is without C , then every 
root of (7) is within C . 

NORTHWE STERN UNIVERSITY, 
EVANSTON, ILL. 



DETERMINATION OF ALL GENERAL HOMOGENEOUS POLYNOMIALS 

EXPRESSIBLE AS DETERMINANTS WHOSE ELEMENTS 

ARE HOMOGENEOUS POLYNOMIALS* 

BY 

H. S. EVERETT 

1. Introduction. Dicksonf has shown that every binary and every ternary 
form, every quaternary quadratic, and a sufficiently general quaternary cubic 
form can be expressed as a determinant with linear elements, and that this 
property does not hold for any other form which is the general one of its degree 
and number of variables. The present paper is a generalization from the case 
of determinants with linear elements to that of determinants with elements of 
higher degrees. 

In the first part it is proved that if the number of variables is greater than 
three, no general form of degree rs is expressible determinantally, that is, as a 
determinant of order r with homogeneous elements of degree s , s ^ 2 . The 
case of quadratic elements is considered in 2. The second part treats the 
determinantal expression of binary and ternary forms. In 5 binary forms 
are expressed rationally as determinants. In 6 a general procedure is set 
up for ternary forms and the remaining sections show that this procedure is 
successful for ternary quartic and sextic forms. 

I. FORMS WITH n &gt; 3 ARE NOT EXPRESSIBLE DETERMINANTALLY 

2. THEOREM 1. When the number of terms in the general form of degree 2r 
in n variables ( n = 2 ) exceeds [ f n ( re + 1 ) 2 ] r 2 + 2 , the form is not ex 
pressible as a determinant whose elements are quadratic forms. 

Let D be any r-rowed determinant with quadratic elements homogeneous 
in the n variables x\ , , x n , which represents the general form of degree 2r . 
Its matrix M may be written in the form 

M = X\ Mil + Xj. X 2 M U + + Xi X n M ln + X 2 2 If 22 + + M nn , 

where each My is a matrix whose r 2 elements are constants. We proceed to 
simplify the matrix M by the method of matrix multiplication employed by 
Dickson.J 



* Presented to the Society, December 28, 1921. 

t These Transactions, vol. 22 (1921), p. 167, hereinafter referred to as D. 

J D, p. 168. 
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Since D represents the general form, x\ r is present, whence the determinant 
of MH is not zero. Thus M n has an inverse Mn such that M n Mn is the 
identity matrix 7 . The new matrix 



N =-- MMn 1 = x\ I + Xl x, Nu + + x* N nn (Nu =-- Ma J/n 1 ) 

has as its determinant D/ \ Mn and has unity as the coefficient of x\ r . Next 
we choose a matrix B such that E~ l N iz B shall have a canonical form P J2 . 
Then our matrix reduces to 

P == B- 1 NB = x\ I + xi .TO Pi2 + + x 2 n P nn (Pa =- 5- 1 N fj B). 

In contrast with the case of linear elements, we cannot show in our case of 
quadratic elements that the roots Xi , , X r of the characteristic equation of 
NIZ are all distinct. Hence we permit PIZ to have the most general form* 
with t distinct X s, t ^= r . Further normalization of P is effected by the matrix 
K commutative with P J2 as defined by Dickson.f In K~~ l Pi 3 K we can 
specialize r 1 elements. 

In case t = r no further specialization is possible. We have then, if we 
count the factor removed from D and the r distinct X s of P\z , 

1 + r+ [kn(n+ 1 ) - 2] r 2 - (r -- 1 ) 

or [ \n ( n + 1 ) 2 ] r 2 + 2 parameters in the modified matrix of D . In case 
t &lt; r the true number of parameters is less than that given above because we 
then have t &lt; r distinct X s, and because further normalization is possible 
since the matrix K is not the most general matrix commutative with P i2 . 
Therefore, the number of parameters given above is the true maximum and our 
theorem follows. 

3. THEOREM 2. When the number of terms in the general form of degree rs 
in n variables ( n =^ 2 ) exceeds 

... - | ^ 



it is not expressible as a determinant ichose elements are s-ic forms. 

The argument in 2 applies when D is a determinant of order r whose 
elements are forms of degree s ^ 2 in n variables, except as to the number of 
terms in the matrix M . This number is [ n ( n + 1 ) ( n + s 1 ) ]/*! . 

4. THEOREM 3. If% n &gt; 3, s s 2, the general n-ary rs-ic form cannot 
be expressed as a determinant of order r ivhose elements are n-ary s-ic forms. 

The number of terms in the general form of degree rs in n variables is 

Srs+n~ 1\ (rs+ 1) (rs + n -- 1) 
\ rs ) (n-1)! 



* D, p. 178; Dickson, Linear Groups, 1901, p. 223; BScher, Higher Algebra, p. 293 

t D, p. 179. 

J For s = 1 see D, Theorem II, p. 171. 
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Then by 3 the general form of degree rs in n variables (n ^ 2) cannot be 
expressed as a determinant with s-ic elements if 

(1) Q+ 1) (r5+ n -- 1) &gt; s 

(n- 1)1 

For n = 2 and n 3 , respectively, (1) becomes 0&gt; (r 1 ) [ (5 -- 1 ) r 1] 
and 0&gt; (r l)[(3s 2)r 2], which hold for no value of r when 5 = 2. 
Now (1) holds if 

(2) (r5+l)(r5+2)(25+3) ... ( 2s + n - 1) 



(n- 1)! 

holds, where the left member of (2) has been obtained from that of (1) by 
replacing r by 2 everywhere except in the first two factors. We consider this 
relation first f or s = 2 . If n ^ 7 , r = 2 , we can cancel from the numerator 
and denominator of the left member the factors 7 , , ( n 1 ) (which are 
absent if n = 7 ) , and then (2) holds if 



6! 



since the right member exceeds the value [ \n ( n 4- 1 ) 2 ] r 2 + 2 of S for 
5 = 2. The last inequality holds if 

n(n-\- 1 ) ( n 4- 2 ) ( n + 3 )/180 = f n ( n 4- 1) 2; 

that is, if n (n 7) ( n + 1 ) (n + 12) 4- 360 s 0, which is obviously true 
for n ^ 7 . For n = 4,?i=5,?i=6, respectively, (2) becomes 

(2r -- l)(2r - 2) (2r - 3) &gt; 0, (r - I)(4r 3 + 24^ - 19r + 6) &gt; 0, 
(r- I)(8r 4 4- 6Sr 3 + 238r* - 107r + 30) &gt; 0, 

each of which evidently holds for r S 2 . So (2) holds for 5 = 2 , r ^ 2 , n S 4 . 
We proceed now to show by induction on 5 that (2) and hence (1) holds for 
5 = 2,rS2,ni4. We assume that (2) holds for r = 2 , n = 4 , and any 
given value of 5. We wish to show that (2) implies the corresponding in 
equality with 5 replaced by s 4- 1 , viz., 

[(5+ l)r+ !][(*+ l)r + 2](2a+5) (2s + n + 1) 

(3) 

&gt; rn(n+l ) ...(n+5)_ 2M+2i 



\n 
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To derive (3) from (2) we must add to the right member of (2) 

rn(re+l) (ro + g) _ n(n+l) (n + s - 1) 1 ^ 
(*+!)! si 

(n- l)n(n+ 1) (n+5- 1 ) ^ 

(*+!)! 
and to the left member 

(25 + 5) (2*+ n 1) , , . 

-- -r 1 - - ( at 2 + 6r + c ) , 

(n- 1)! 

where 

a = 4(n -- 1)5 3 + (n 2 + 9n -- 4)5 2 + 2(n 2 + 3w + 1)5 + w(w + 1), 
6 == 12 (n- 2)s 2 + 3(n 2 + 5n - 10)5+ 3n(n + 1), 
c= 8(n- 3)5 + 2(n 2 +n-- 12). 

To establish the induction we need 

(25+5) (2 + n-- l)(ot a +6r+c)/(n- 1) 

S (*+2) (* + n- 1)^. 

Since the s-factors on the left are each greater than the corresponding 5-factors 
on the right, and since br + c &gt; for n ^ 3 , (4) is implied by 

(5) o/(n--l)S(*+2)(+3)(*+4). 

Now (5) holds for n = 4 if it holds for n = 4 since the coefficients of s are all 
positive for these values of n . But for n = 4, (5) becomes 

9s 3 + 21s 2 - 205- 52 &gt; 0, 

which obviously holds for 5 = 2. Our induction is therefore complete and 
Theorem 3 follows. 

II. BINARY AND TERNARY FORMS EXPRESSED DETERMINANTALLY 

5. Binary forms. Since the binary form of degree rs can be factored into 
rs linear factors, it is expressible as an r-rowed determinant each of whose 
diagonal elements is a product of 5 linear factors, and whose elements outside 
the main diagonal are zero. Dickson has expressed the binary form rationally 
as a determinant with linear elements.* Every binary form of degree rs can 
be expressed rationally as a determinant of order r with 5-ic elements : 

*D, p. 175. This determinant can be reduced by elementary transformations to the 
present one. 
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6. Ternary quartic form. The general ternary quartic may be given the 
form 

where a and b , c and d, e and/, are three pairs of bitangents of a Steiner set.* 
This function equals the determinant 

(ab cd ef) 2cd 

2ef (ab-cd-ef) 

The determination of the bitangents depends, however, upon the solution of 
equations of high order. In the case of the general ternary quartic form with 
no repeated factor Dickson s determinant f of order r 4 with linear elements 
can be reduced to a determinant of order r = 2 with elements of degree s = 2 
by the following elementary transformations. Multiply the third row and 
divide the fourth column by X + c 44 z; multiply the second column and 
divide the first row by Xi + en z; multiply the first column by z and add 
to the second column; multiply the fourth row by z and add to the third 
row; multiply the second row and divide the second column by z. The 
resulting determinant is 

772 3, 

l 2 ^21 & ^* 

. ( c 32 h c 42 z ) z ( c s i k c 41 z ) Z 3 / 4 c 43 z 2 

where Z; = Xi -\- cu z . No elementary transformations have been found 
which accomplish a similar reduction of Dickson s determinant of order rs &gt; 4 . 

7. General ternary form. The remainder of this paper is concerned with 
a method of expressing a sufficiently general ternary form / of degree rs as a 
determinant of order r whose elements are s-ic forms. We may assume that 
/ is irreducible. For, if / = /i/ 2 where f\ and f% are of degree r\ s and r 2 s 
respectively and expressible as determinants of order r\ and r 2 whose matrices 
are M\ and M 2 , then / equals the determinant of the matrix 

fi 
M, 



,) 



* Miller, Blichfeldt, and Dickson, Finite Groups, 1916, p. 355. 
tD,p. 174. 
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where is a matrix all of whose elements are zero. With Dickson* we choose 
a triangle of reference such that, f or z = , / ( x , y , ) = XL X X rs , where 
Xi = x + X y and AI , , X rg are all distinct. Then 



X rs 



E 
1=1 



where F k is a binary form of degree rs k . We shall attempt to express / 
as a determinant of the following special type: 

X\ X s + Z0U 2012 

021 Xs+l X 2s + 2022 

20rl Z0r2 X 

where each 4&gt;n is a ternary form of degree s 1 . There are ^ (rs -^ 1 ) ( r* + 2 ) 
coefficients in/. In identifying /(.T, t/, 0) with A"i, , X TS , re + 1 of these 
coefficients have been fixed. So the identification of D with / involves 
%rs ( rs + 1 ) conditions on as many coefficients of the 0;/ . We therefore 
assign simple values to the remaining ^rs (r 1) coefficients of the 0; ; -; in 
each 0,-y (j &gt; i) we set the first s coefficients equal to zero with the exception 
of the first coefficient of each 0,-; ( j = i + 1 ) which we set equal to one. 

We then expand D axially and consider the identification of the terms of 
this expansion, arranged in ascending powers of z, with the corresponding 
terms of any given form /. First, the terms linear in z will be identical with 
zFi (y , x} of/, where FI is any given binary form of order rs -- 1 , if they are 
equal for the rs values x = A; y (i I , ,). The resulting rs con 
ditions involve the coefficients of the 0;y of D . As a matter of fact, it will be 
seen in the following sections that they uniquely determine rs coefficients of 
the &lt;t&gt;a . Next, the terms quadratic in z will be identical with z 2 F* ( y , x) of/, 
where F 2 is of order rs 2, if they are equal forf x --\iy (i = 1, 
, rs 1 ) . The resulting rs 1 equations involve coefficients of D . 
Proceeding in this manner to identify terms in z k ( k = 1 , , rs ) we obtain 
a system of rs + (rs 1 ) + - + 2 + 1 = %rs (rs + 1 ) equations in as 
many unknown coefficients of D . Since these equations involve arbitrary 
parameters FI, , F rs , and since by the implicit function theoremj solutions 
exist in the neighborhood of a point for which the jacobian of the equations 
does not vanish, the method outlined above effects the identification of D 
with/ if the jacobian of these equations is not identically zero. 

* D, p. 172. 

t Or for any other choice of rs 1 of the X s. 

t Bliss, Princeton Colloquium Lectures, 1913, p. 7. Attention is called in this connection to 
MacMillan s method of solution involving the introduction of a parameter and the expression 
of all solutions in power series of this parameter. See MacMillan, Mathematische 
A n n a 1 e n , vol. 72, p. 180. 
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8. Ternary quartic form. A sufficiently general irreducible ternary quartic 



with distinct X s can be expressed as the determinant 



D = 



A^ 2 + ( h n x + k u y + lu z ) z ( x + /i 2 2 ) z 

The ten equations which arise in the identification of D with / follow. For 
brevity we write the frequently recurring expression X u X,, as (uv) , and 
k uv Xu, h uv as ( uv w ) . 

{(30(40(110 = F,(l, -Xi) (i= 1,2), 

1 (lj)(2j)(22j) = Fi(l, -Xj) = 3,4); 

j(30(40/n + (lH)(22i)+X,-(21i) F&gt;(1, -Xi) (i = 1,2), 
1 (13)(23)Z 22 + (113)(223) + X 8 (213) = F 2 (l, -X 3 ); 

(110/22 + ( 22O/H - (2H)/i2 + X,-/2i == F 3 (l, - Xi) 

/ll /22 /12 /21 = -T 4 

The first four equations are linear and determine uniquely ku , 
since their determinant is 

M 4 = miTOi = (31 ) 2 (41 ) 2 (32 ) 2 (42 ) 2 (21) (43), 



=-- 1,2); 



"22 , 



where 
mi=(31)(41)(32)(42) 



1 - Xi 

1 -X 2 



m 2 =(13)(23)(14)(24) 



-X, 

-X 4 



and is different from zero in view of the fact that the X s are distinct. Since 
this system falls naturally into groups of 4 , 3 , 2 , and 1 equations each, the 
Laplacian expansion of the jacobian of the system by minors of the last row, 
of the eighth and ninth rows, of the fifth, sixth, and seventh rows, and of the 
first four rows yields most readily the complete coefficient of the various terms. 
If the jacobian has its columns in the order ku &gt; hu &gt; 22 , ^22 , kzi , ^21 , ^22 &gt; In , 
lii , /i2 , we find on the main diagonal the only term of the expansion actually 
involving Z 2 i ^22 Its coefficient is 

MtMsMzM! = (21) 3 (31) 3 (32) 3 (41) 2 (42) 2 (43)XiX 2 , 
where 

-X 2 

M a = 



A2 A2 




X 3 -XI (13) (23) 

We may assume that Xi X 2 ^ , since if one of the four distinct X s is zero it 
may be taken initially to be X4 . Then since the X s are distinct, the coefficient 
of /2i &22 is different from zero and the jacobian does not vanish identically. 
9. Ternary sextic form, s = 3 . A sufficiently general irreducible ternary 
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sextic 

(6) 



i = x-\- 



with X s distinct can be expressed as the determinant D with cubic elements 

X 3 + zcf&gt;n z&lt;f&gt;n 

Z021 Xi X$ X& + Z0 

where &lt;;y = a# x 2 + b^ xy + a, y 2 + da xz + e^ yz + /y z 2 , and where in 
particular 012 = 1 , &i2 = Ci 2 = . The identification of Z&gt; with / involves 
the following 21 equations. We use the abbreviated notations ( uv ) = X u X , 

( WDW ) = e uv X w rf ul) , and [ uvw ] = a m X^, 6 U1) X, + C OT . 

f (4i)(5i)(6i)[llt] = ^(1,-XO (t 1,2,3), 

(j = 4,5,6); 

F,(1,-XO (i = 1,2,3), 
^ 2 (l,-X y ) (j = 4,5); 



= 1,2,3); 
(i = 1,2); 



(4)(5t)(6t)/ii+[ll*](22t) 

-X;(21*)-[21i](12i) = F 3 (l, -XO (t== 1,2,3), 
(14) (24) (34 )/ 22 + [114] (224) + [224] (114) 

-M(214)- [214](124) - F 3 (l, - X 4 ); 

+ [22i]/ u + (Hi) (22i) - X?/2i - [21t]/ 12 

- (12i)(21i) = F t (l, -X t -) 

+ (22i)/u (12i)/ 21 - (21t)/i 2 = F 5 (l, - XO 

/11/22 ~ /12/21 = F 6 . 

With the columns of the jacobian arranged in the order an , 6 U , GU , a 22 , 622 &gt; 

C22 , 21 , ^21 , C 2 i , 0*22 , 622 , C?21 , Czi , ^11 , fill , /ll , /22 , /21 , ^12 , ^12 , /12 , W6 find On 

the main diagonal the complete term of the expansion involving /| t bn c\ 2 
Its coefficient is 

M 6 M 5 M^ M 3 M 2 Mi = X? Xi Xf ( 12 ) 5 ( 13 ) 4 ( 14 ) 4 ( 15 ) 3 ( 16 ) 2 ( 23 ) 4 

X (24) 4 (25) 3 (26) 2 (34) 4 (35) 3 (36) 2 (45) 2 (46)(65), 
where 





V _, 1 -*1 


-A? 


Mi = - - 1 , M 2 




i j- 

Xo - 1 


&gt; M 3 = 1 X 2 


-Ai 




1 -As 


AS 




Al Xf Xi 
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_ \4 \3 \2 




If 4 = 


X2 X2 A 2 

AS AS AS 


1 
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x i A i A i 

A2 A2 A2 
% 4 N 3 -v 2 


I 




Al -XI - A 4 
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A 3 A 3 AS 






X? - Ai 1 




Xi Xi 1 




MS = 


X| - A 2 1 




Af. -A 5 i 






X! -X 3 1 




X 2 6 -Xe 1 
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We may assume that Xi X 2 As 7^ , since if one of the six distinct X s is zero, it 
may be taken initially to be Xe . Then since the X s are distinct this coefficient 
is different from zero and the jacobian does not vanish identically. 

10. Ternary sextic form, s 2 . A sufficiently general ternary sextic (6) 
can also be expressed as the determinant D with quadratic elements 



D = 



2023 
+ 2033 



where 0;y = h^ x + kfj y + l^ z and Ai 2 = ^23 = 1 , h 13 = fti 2 = ftis = k^s = . 
Let A be the determinant |Z,-y| of third order, and let LH be the minor of la 
in this determinant. In the following 21 equations which arise in the identi 
fication* of D with (6) we use the abbreviations 

(uv) = \ u X v , ( umv ) = k uv \w h uv : 



^(l, -Xy) (j = 3,4), 

(l*)(2fc)(3fc)(4Jfc)(33Jfc) = ^(1, - Xt) (fe = 5,6); 



50(6i )/n+ (30(40(110(330 
+ (50(60(110(220+ (50(60 X t - (210 = *2 ( 1 , -X,-) (i = 1,2), 
(13) (23) (53) (63) / 22 + (13) (23) (223) (333) 

+ (13)(23)X 8 (323) + (53) (63) (113) (223) 



(Ifc) (2Jfc) (3&) (4A;)? 33 + (U) (2ft) (22ft) (33ft) 

+ (1ft) (2ft)X fc (32ft) + (3ft) (4ft) (lift) (33ft) 

= *(!, --X*) (ft = 5, 6)5 

(30 (40 [(HO ^33- (310^13+ 



+ (110 (220 (330 + (HO (320 X, 

+ (2lO(33OXi+ (3H)X?= F 3 (l, - X,-) (i = 1,2), 
(1ft) (2ft) [ (22ft) Z 88 - (32ft)/ 23 + (33ft)/ 22 + X fc / 32 ] 

+ (3ft)(4ft)[(llft)Z 33 - (31ft)/i 3 + (33ft)/n] 

+ (lift) (22ft) (33ft) + (lift) (32ft)X* 

+ (21ft)(33ft)X fc + (31ft)Xl= F 3 (l,-\ k ) (ft = 5,6); 



* See second footnote, 7. Here it was found advantageous to select the X s in the order 
Xi , X2 , X , Xs , Xs , X* , since this selection gives the minimum number of terms in certain equa 
tions. Thus, the tenth and eleventh equations contain fewer terms than does the ninth 
equation. 



194 



H. S. EVERETT 



(32i) J n + X,- 



. Similarly for i 6 with first term replaced by ( 16 ) ( 26 ) in; 

\ 1 -i-t j -L-/11 ( a Li j -L/21 "T" V ol i j JL/31 ~T~ Ai JL/12 ~T~ ( l ) -/-^ 2 2 

/ net \T l\7~ l/-OO \r F/ l \\ 

- v o/t ) -L 32 ~r A; -L23 ~r ( 061 ) -L 33 = r 5 (^ 1 , A 4 - ; 

AK 1 
" 6 



i = 1,2); 



With the columns of the jacobian arranged in the order ku, hn, & 22 , A 22 , 

^33 , hs3, kzi, llll\, /22, ^32, ^32, &31 , ^31 &gt; ^33, ^32 &gt; hi, ^21 j ^31 &gt; ^12, ^23, ^13 WC find 

on the main diagonal the complete term of the expansion involving 
lli l%2 hn k% 2 &33 Its coefficient is 



M 6 M 6 
where 



M 3 



z M l = \l X| X 5 X 6 ( 12 ) 5 ( 13 ) 3 ( 14 ) 2 ( 15 ) 5 ( 16 ) 6 

X (23) 3 (24) 2 (25) 5 (26) 6 (34)(35) 3 (36) 3 (45) 2 (46) 2 (56) 3 , 



x? 

|x! 





1 Xj 




1 X! X 2 


2 = 


1 X 2 


M 3 


1 X 6 X 2 


-x? 










-xi 










-Xi (15) (25) 


(15)(25)X 6 





-XI (16)(26) 


(16)(26)X 6 





= (13)(23)(53)(63)(51) 2 (61) 2 (52) 2 (62) 2 X 1 X 2 X 5 X 6 

1 -X: 



X 



-X 2 



= (31) 2 (41) 2 (51) 2 (61) 2 (32) 2 (42) 2 (52) 2 (62) 2 (53) 2 



X (63) 2 (54) 2 (64) 2 



~Xo 



-X 3 
-X 4 



-X 5 
-X 6 

-x, 

-x 6 



We may assume that Xi X 2 X 5 X 6 5^ since if one of the six distinct X s is zero 
it may be taken initially to be X 4 . Then since the X s are distinct the coefficient 
of this term is different from zero and the jacobian does not vanish identically. 
No attempt has been made to prove that for ternary forms higher than the 
sextic the corresponding jacobian is not identically zero. 
BUCKNELL UNIVERSITY, 
LEWISBUEG, PA. 



GENERAL VECTOR CALCULUS * 

BY 

JAMES BYRNIE SHAW 

INTRODUCTION 

This paper presents results from various papers read before the Society 
during a period of several years. These are indicated in the footnotes. The 
calculus is independent of the number of dimensions of the space in which the 
vectors are supposed to be placed. Indeed the vectors are for the most part 
supposed to be imbedded in a space of an infinity of dimensions, this infinity 
being denumerable sometimes but more often non-denumerable. In the sense 
in which the term is used every vector is an infinite vector as regards its dimen 
sionality. The reader may always make the development concrete by thinking 
of a vector as a function of one or more variables, usually one, and involving a 
parameter whose values determine the " dimensionality " of the space. The 
values the parameter can assume constitute its " spectrum." It must be 
emphasized however that no one concrete representation is all that is meant, 
for the vector is in reality an abstract entity given by its definition, that is to 
say, postulationally. The case is analogous to that of " group " in which the 
"operators" are generally not operators at all, since they have nothing to 
operate upon, but are abstract entities, defined by postulates. Always to 
interpret vectors as directed line-segments or as expansions of functions is to 
limit the generality of the subject to no purpose, and actually to interfere with 
some of the processes. It is sufficient to notice that in any case the theorems 
may be tested out in any concrete representation. The author desires to 
state that he discriminates between the terms function, values of a function, 
expansion of a function. The function is the law which enables one to ascertain 
the values. The table of values is not the function. The expansion is the 
system of coefficients of the basis functions by means of which we may have a 
particular representation of the function. Further a system of coefficients 
does not constitute a vector; such a system the author calls a multiplex. 

It is true that a multiplex appeals to many mathematicians as a sufficient 
entity to serve as vector, but a multiplex does not specify a vector till the basis 
(or in geometry the set of axes) is specified. A system of coefficients does not de 
fine the same function when attached to powers of x as when attached to sines in 
a Fourier series; the vector (1,2,3) has no meaning till the directions for the 

* Presented to the Society at various meetings, 1909-1921. 

Trans. Am. Math. Soc. 14. 
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numbers are assigned; and 1 -- .r 2 is the same function whether expanded in a 
Fourier series or a Bessel series, though its multiplex is different. The essence 
indeed of a vector calculus lies in its ability to produce invariantive and co- 
variantive expressions, and it does this because it uses vectors directly and not 
some particular representation of the vectors. A calculus which produces 
formulas in x , y , z , even though they may be at once replaced by x , y , z , 
and preserve the same formulas, may be an invariant calculus, but it is not a 
vector calculus. At the best, symbols which stand for such expressions referred 
to axes for their meaning are merely short-hand symbols. 

The method developed here has been found by years of use to be flexible, 
smooth-running, simple, and to include all the systems of vectors now extant. 
Its expressions are all covariantive for the simple reason they have no depend 
ence upon any system of reference. They are compact and readily interpreted 
in such applications as differential geometry. It furnishes the basis for integral 
equations, and the general theory of distributive operators. In the abstract 
the intention is to create a calculus of symbols which are defined by the rules of 
the calculus. Any entities that submit to these rules may be represented by 
the symbols so far as this calculus is concerned, though from other points of 
view they must be treated differently. These symbols stand for entities which 
are subject to certain processes. A process is not a function; it produces 
entities which are functions of the entities subject to the process, but the process 
is not the function. For instance we use a process called addition, and one 
called multiplication. There are several called "A" processes which are not 
"multiplications." The author is aware that some mathematicians like to 
call some of these A processes by such names as inner multiplication, outer 
multiplication, indeterminate multiplication, and the like, but he has not yet 
observed that this multiplicity of names has been any advantage in the develop 
ment of the subject. 

On the side of vectors in finite space (as to dimensions) this method includes 
the Clifford algebras,* and might be supposed to be identical with the general 
treatment of Clifford. The question has been asked : Why not assume a system 
of units, ii,i z , , i n , whose squares are -- 1 , and such that i p i q = - i q i p 
when p j^ g? All the formulas in the finite cases may be worked out from this 
point of departure. They might then be generalized to the case of a denumer- 
able infinity of units. This would indeed at least simplify much of the extant 
work on the uses of orthogonal functions, vectors in a "Hilbert space," etc. 
But when we take up the most general case of a non-denumerable space it 
would fail. The present development has no reference to any dimensions. 

* Consult Clifford s Works; Joly, Proceedings of the Royal Irish Acad 
emy (3), vol. 5 (1897), pp. 73-123; vol. 6 (1900), pp. 13-18; M Aulay, Proceedings 
of the Royal Society of Edinburgh, vol. 28 (1908), pp. 503-585. 
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This must be well understood by the reader, for it is fundamental. The 
Grassmann treatments are hampered by their assumption that the "outer 
product" of all the basis units of the system is a numerical value. Without 
knowing the number of dimensions of the space the formulas cannot always be 
written. The formulas here are equally valid whether the vectors lie in a space 
of three, three thousand, a denumerable number, or a non-denumerable 
number of dimensions. There is no basis, for every vector is an entity unique 
in itself. It defines with its numerical multiples a linear space, any two define 
with their numerical multiples and under addition a binary space, any n an 
w-ary space, under certain restrictions to be noted. 

I. FUNDAMENTALS 

1. DEFINITIONS 

1. Vectors. Vectors are represented in notation throughout by small Greek 
letters, or by expressions built up with small Greek letters, and other symbols. 
Capital Greek letters invariably mean operators. Roman (italic) letters mean 
numerical values selected from a given domain, or field. An exception some 
times is furnished in using ir in the ordinary sense. 

Vectors are defined by stating the conditions to which they are subject. 
These are as follows. 

(1) Each vector gives rise to a set of vectors, called its multiples, repre 
sented by writing the symbol of the vector and a symbol for a number 
(or mark) chosen from the given field, thus from a we have aa , 3a, ira , and 
the like. No distinction is made between aa and aa. The multiples of a 
vector are as numerous as the marks or numbers of the field. Any one of the 
multiples may itself be taken as the initial vector and the others, including the 
original vector, are then multiples of it. No vector is a unit vector absolutely, 
but only relatively. 

(2) In case we have one more vector which is not among the multiples of the 
vector a , say /3 , then the multiples of a and the multiples of /3 enable us to 
form a binary complex of vectors, indicated by xa + y$ , where x and y inde 
pendently assume all values in the field. The vector = xa. + yfi is a unique 
vector, but is said to be dependent upon a and j3 , or not to be linearly inde 
pendent of them. We call the process of forming these vectors of the complex 
addition of vectors. We assume as a matter of definition that 

= xa + y(3 = yj3 + xa , 

and further that, in the case of still a third vector, not in the complex of a., 
and (8 , say 7 , if = xa + yfi , f = y(i + 27 , a = + zj , r = xa + f , then 
in all cases a = T . 

This is called associativity of addition. 
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(3) If == X x n fx n , then the multiples of , as c, are also to be formed 
from the same multiples of x\ a\ , x z &lt;*i , &gt; that is 



We see that by this last condition 

xa + ya + = ( x + y -\- ) 

(4) If is included in the field, then we have vectors Oa , 0/3 , and these 
are indistinguishable. 

It follows that we may consider a + = a in all cases. Further we see 
that the multiples of a, /3 , 7, etc., which determine a complex are to be con 
sidered as in the complex. Hence the vectors which determine a complex are 
not unique. Whatever vectors are used to determine a complex are called a 
basis for the complex in case they constitute an irreducible set, in the sense that 
no one of them is included in the complex of the others. That is the same as 
saying that ai , a* , , a n form a basis for a complex in case 

Op = .TI i + .r 2 2 + + ? an 

only for x\ x z = = x n = . They are said then to be linearly in 
dependent. 

(5) When the number of linearly independent vectors is not finite we 
indicate them by using a parameter attached to the vector symbol, sometimes 
as a subscript, as in the case of a denumerable infinity, thus &lt;XL, a, , a x ; 
and sometimes as a parameter in a parenthesis, as a(i), as in the case of a 
non-denumerable infinity; The complex in the first case is indicated by 

= Z!r - T or S x n a(n). 
The complex in the second case is indicated by 



Xi 
x(i)a(i)di. 



As an instance of each case, so far as our conditions go we may consider that 
functions of a single variable are vectors. We would then have, using z for the 
variable, i for the parameter, in the first case, an instance from expansions in 
orthogonal functions, 

(z) = ]Ln0n (2); 

in the second case, an instance from the expression of a function as a definite 
integral, 



= f 

Ja 
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When we apply vectors to functions we need to note there are further restric 
tions on account of the problems of analysis. We are concerned here largely 
with the problems of algebra only. 

When we undertake to find the coefficients (from the field) which would 
enable us to express a given vector as belonging to the complex of a given basis 
(finding the components of the vector), in case it is possible to express the vector 
thus, that is, in case the vector does belong to the complex, we say we resolve 
the given vector on the given basis. This is exemplified in the process of finding 
components of velocity, or force, or vector field, or Fourier coefficients, or 
solving a linear homogeneous integral equation. The conditions, in the func 
tion case again, that such resolution may take place, have been extensively 
studied in certain special cases. The study of such conditions as result from 
these analytic problems has been carried far by E. H. Moore, whose papers on 
General Analysis may be referred to. The definitions of norm, Hermitian 
square, generalized theorem of Pythagoras, etc., find their place in such 
investigations. 

(6) Vectors are further subject to conditions which are the main object of 
the present paper. These result from the formation of vectors of grade 2,3, 

, and their combinations into complexes, and combinations into complexes 
of mixed grades. These will be treated separately under the heads: the accretive 
process, the decretive process, the Hamiltonian process. The latter will also be 
called the product. 

2. THE ACCRETIVE PROCESS 

2. Alternants.* In general an expression is alternant when it changes sign 
with the interchange of two of its elements. The usage here is not contra 
dictory to this but supplements it considerably. As an example of an alternant 
let us consider n functions of n different variables, giving us n 1 different func 
tions 

on (Sj) (i,j = 1,2, ,n). 

Then the expression Y, &lt;i ( *i ) ;, ( *2 ) ; ( s n ) , where the subscripts i 
are the numbers 1,2, , n in some permutation, all permutations occurring, 
and the sign is + or according to the number of inversions, is an expression 
which changes sign if any two subscripts are interchanged. It is therefore an 
alternant. 

The alternants we introduce are defined by the following conditions: 

(1) From any n vectors we may construct an alternant, written A n a\ &lt;x 2 

a n , called a simple vector of grade n . 

* The remainder of part I contains the results of papers read before the Society as follows : 
Chicago Section, December 26, 1913, The two fundamental operations of general vector analysis; 
December 31, 1909, On Hamiltonian products; Southwestern Section, November 27, 1909, 
Scalars of lineal products. 
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(2) The interchange of any two vectors changes the sign. 

(3) If any one of the vectors is in a complex, say a; == &gt;./ ft- where the 23 
may be used also to include the non-denumerable case 



then 

A n -OLi 2 Cii a n = X) a J -In 1 2 /3; 

It follows at once that if any two vectors are equal the alternant vanishes, 
that if any vector is in the complex of the others the alternant vanishes, and 
that to any vector may be added any vector in the complex of the others 
without altering the alternant. The distributivity property (3) is very 
important. 

(4) AI a. = a. The alternants begin with A 2 . We preserve the notation 
since it is valid in general theorems. 

(5) If A n a.1 2 oi n = then for some set of values x\ , .T 2 , , x n not 
all zero 

.Tl ai + .T 2 2 + + Zn CX n = . 

This makes either equation a necessary and sufficient condition for the other. 
It precludes using as the alternant denned above any alternant expression. 
For instance in quaternions we might use as A? a/3 the expression 
V 0(a)0(/3) where 6 ( ) is a linear vector function, but it would be necessary 
to choose 6 as non-singular. Otherwise the expression might vanish because 
a. or /3 happened to be an invariant line for the root . Similarly we might 
take as A 2 a/3 the expression 



C P[a 

\) a *J d 



but we should have to place restrictions upon what 4&gt; ( ) could be. Indeed in 
either example we can consider that we do not really have the simple alternant, 
but a function of the alternant, for they may be written respectively $F a/3, 
and $ [ a O ) /3 ( P(s)a(t)]. The function might vanish when the 
elementary alternant does not. We limit the meaning of A n otherwise, 
however, only by the postulates. 

3. Vector of grade n .* A basis of alternants of grade n gives a complex of 
grade n , and any member of the complex is a vector of grade n . A vector of 
grade n is indicated by V n It does not have to reduce to a single alternant, 
as for instance A z a/3 + A 2 78 may or may not reduce to the form A 2 ef . 
Many vectors of grade n do however reduce to single terms which are repre 
sented by the symbols A n . Such is the case to be defined, which is written 

*This is not the "polyadic " of some authors. 
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A n ai 0.1 a n+k where k ^ . Such expressions have properties to be 
proved which may be stated in terms of the permutation groups under which 
they are invariant. In case k = we might say that A n ai a 2 a n is 
invariant under the alternating group of permutations of its vectors, and 
changes sign for the remaining substitutions of the symmetric group. Other 
expressions constructed from an initial term by applying the substitutions of 
some group would be interesting and useful, but little study of them has 
occurred. 

4. Accretive process. The accretive process is now defined by the equation 

A p+q A p aifX 2 OtpAq PiPz (3q = A p+Q (Xl 2 dp ft /3 2 j3 9 , 

idler e p , q , are any integers. Hence if the alternants are interchanged the sign. 
is multiplied by ( - l) p . 

3. THE DECRETIVE PROCESS 

5. Scalars. The term scalar was first applied by Hamilton to quaternions 
which could be included in the ordinary number scale. It is used here in the 
same sense but with the further application to a process which gives numbers 
(marks) belonging to the field, from vectors, whatever the grade of the vectors. 
We shall use A Q to indicate such results. We define first: 

(1) A A n 0.1 oii a n A n ft (82 fln is a number or mark of the 
coefficient field. 

(2) AQ A n &lt;Xl 2 a n A n 01 j8 2 j8 B 

= ^1 A n /3i /So (3 n A n Oi\ CC* . 

This will be found to be a special case of a more general formula. In particular 

AQ a/3 = AQ (3a. 

(3) A*-A f ()A q () =0, p^q. 

(4) A - (P + Q+ )(R + S+ .-.) 



for any expressions P , Q, , R, S , . That is, A is distributive. 

Instances of the scalar process are found in the expression called inner product 
by some mathematicians. In geometric vectors it is usually defined to be the 
product of the lengths of the vectors by the cosine of their angle, which angle is 
defined for the space involved. In functions we have 



/& 
a/3 = a(s)p(s )ds. 

Ja 



We do not identify the scalar process A with the scalar process in quaternions, 
though the latter is included. For we might take A a(3 = Sad (j3) where 
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6 ( ) is a self-transverse (symmetric, self-conjugate) linear vector operator 
which has no zero roots. Such a form is implied in the integration with a 
matrix defined by E. H. Moore in his General Analysis. 

Scalars of vectors of grades higher than one are expressible by scalars of 
vectors of grade one. 

6. Orthogonality. If the scalar of two vectors of any grade vanishes the 
two vectors are said to be orthogonal. A system of vectors may be mutually 
orthogonal, as for instance a system of mutually perpendicular vectors in 
space, or a system of functions such as t = : V2 sin itrs , under the definition 



Xi 
cxi (s)aj (s)ds. 



Biorthogonality. Two systems of vectors a(i), 0(j) are mutually bior- 
thogonal in case we have 

AQ a(i)j3(i) ^ 0. 

Biorthogonality and orthogonality play an important part in the problem of 
expansions mentioned earlier. For instance they enable us to find the coeffi 
cients under proper restrictions. 

7. Decretive process. We are now in a position to define the general de 
cretive process. We define first the expression 

A n -\ OiA n 01 02 0n = ]C ( " 1 Y~ l AQ Cifii A n -i 01 n 

I 

where the i written under 0i n means that 0; is missing from the expression. 
That is 0i n means 0i 0*_i i+ i n , with the particular cases 

0i n means 2 n and 0i n means 0i n _i . This expression 

1 n 

is evidently a vector of grade n 1. The decretive process reduces the grade. 
We define further 

^4 n _i (A n 0i n ) a = ( 1 ) n - 1 A n -i aA n 0i B . 

We shall see later that we are here following a general law of permutations 
which is the same as the determinant law for change of sign, but we have to 
notice that in A the two vectors may be permuted without change of sign. 
If we iterate the process just defined we would have 

f*-Q OL% pi AQ Q?2 pj i j 

This is evident since the multiplier of A n ^ 0j n will come from two 

, i 
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sources; being, if we omit the signs, one way A 2 ft- AQ a\ ft-, and the 
other way A ai Pi A on fa . But if i &lt; j the sign for the first is ( ) i+ -&gt; + 2 
and for the second ( ) i+;+3 , hence they have opposite signs. It is easy to 
see by induction that for r iterations we have 



A n - r Oil 



A n -l Oir A n ft ft, 

( V rMrl Nr+r 

{ ) 



/ . 



, a A . a 

Oik Pm -ft-nr Pi 



a 

Pn 



where k = 1 , 2 , , r , m = ii, i 2 , , i r , i\ &lt; iz &lt; &lt; i r - 

The formula for operation on the right instead of the left is easily written. 
As an instance in functions, we may define A n J3\ - @ n to be the 

determinant 

^E & (* Al )&(**,) Ai(O 

a function of n independent variables. If now we multiply by ai(s ), we 
may produce the decretive process by forming the sum, with proper sign, of the 
integrals we have, when we set in turn s = si , s = s z , , s = s n . 

Since the vectors a enter the expressions above in an alternating manner 
we are warranted in the final definition, for the decretive process on two 
vectors of any grades, 

A n -r A r (Xl Oir A n ft ft 

= A n - r Oil An-r+I 2 ^n-1 r A n ft j3 n , T g U , 

and we can show that 

A n ^ r -A n () A r () = ( - Y^ A n - r -A r (} A n () when r^n. 
In either case if r = n we have the important formula 

A A n ! a n A n -fa- ft, == ( - )*( 



A a, ft- | 

(i,j= 1,2, 



,n). 



The expression just found is sometimes called the Grammian when the a s and 
/3 s represent functions. In this case we define 



fb 
a(s)(3(s)ds. 
_ 



For n functions the vanishing of the Grammian is a necessary and sufficient 
condition of their being linearly dependent. However, this is really because of 
the presence of the A n of the n functions. It is really what vanishes. It 
is evident now that we may rewrite the definition of the general decretive 
process in a theorem 



n - r A r Oil 



&lt;X r A n Pi " (3 n 

i+ir( - 1) A A r 
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It is to be observed that the sign is to be obtained by the determinant rule, 
since we pass ft, over i\ -- 1 preceding /3 s, j3; 2 then over i z 2 preceding /3 s, 
etc. The total inversions are then ii + t 2 + + *V + i" ( f + 1 ) This 
theorem is a particular case of a more general one to follow, which, however, 
is to be stated as a definition" of both the accretive and the decretive processes 
for all cases. 

The expression A A n () A n () is called the norm or square of the intensity 
of the vector A n , as for instance A A z a/3A z a/3 is the square of the in 
tensity of A 2 aft . An alternant divided by its intensity, if this is not 0, gives 
a unit alternant. 

By means of the two processes it is possible to construct the whole of a 
general vector calculus.* 

8. When A A n a\ a n A n /3i j3 n = 0, we know from the prop 
erties of determinants that if the rank is h, by proper addition of rows and 
columns we may reduce the determinant to one in which either n h rows or 
n h columns contain only , which in the present determinant will give 
vectors in these rows or columns that are in the complex of the vectors a. or 
the complex of the vectors /3, respectively. It follows that n h vectors in 
the complex of /3 s may be chosen so as to all be orthogonal to every vector a , 
or we may choose n h vectors in the complex of the a s which are all orthog 
onal to all the vectors /3 . It follows that the complex of the remaining h 
linearly independent vectors in the complex of the a s, and the remaining h 
linearly independent vectors in the complex of the /3 s, define two complexes of 
h dimensions, which have a common incident complex of h dimensions. We 
must not assume that the complex of order h of the vectors a can be identified 
with that of order h of the vectors /3 , since each vector of the first may be of 
the form j a;,- 7,- + &, j = I, , h, and the vectors of the other complex of 
the form j y t 7* + ,- , j = 1 , , h , where the vectors f and are orthogonal 
to all the vectors 7 , and all vectors f are orthogonal to all vectors . The A 
process leaves only the bilinear homogeneous forms I&gt;; ?/,- . It follows that 
the h vectors a are not necessarily expressible as in the complex of the h vectors 
/3, and vice versa. We may say then that the complex of the n vectors which 
determine a vector of grade n , and the complex of the n vectors which deter 
mine another vector of grade n , may have with regard to each other a rank h , 
which is the order of two sub-complexes, one from each complex, which have a 
common sub-complex of order h . In the geometric case it is easy to see the 
significance of these facts. In the function spaces, however, they are also sig 
nificant. When the general decretive expression vanishes, each scalar coefficient 
vanishes provided the vectors /3 are linearly independent. Hence all vectors 
a and /3 become subject to the remarks just made. 

* This was presented to the Society in a paper read March 26, 1921, On Hamiltonian 
products, second paper. The contents are incorporated in this memoir. 
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9. In the alternant A n ai a n we may substitute other vectors without 
changing the value of the alternant. These may be chosen so as to form a 
mutually orthogonal system if their intensities do not vanish. For it is evident 
that 



.1 Oil 



A 



is orthogonal to i and its substitution for oj 2 will not change the value of the 
alternant. We may now find a vector orthogonal to both ai , a 2 to substitute 
for a. z , etc. In short if we set 



a( = 



a 2 = AI Oil AZ Oil Oiz/ Ao Oil Oil , 

oi z = AI AI ai ai A 3 ai a^ a s /Ao A* ai a&lt;i A z a\a.i, 

we shall have A n a.\ a. n -- A n a( a n . 
The process would of course break down in case 

AQ 0.1 0.1 = 0, AQ AI ai Oiz AI ai 2 = 0, 

etc. But these are singular vectors and are excluded from the cases that are 
under consideration. This process of orthogonalizing vectors is of importance 
in the study of sets of functions, integral equations, etc. If the A process is 
defined as in the General Analysis of Moore, so that it is a Hermitian product, 
the vectors may always be orthogonalized. 

10. In case p is linearly expressible in terms of j3i , -,/?, since we have for 
any set of n vectors ai, , a n (writing merely a for ct\ a^ a. n and /3 for 

01 & .-.ft.) 



AI- A n 0&gt;A n+l /3p = = (A A n OiA 

- (A A n aA n ft 0n_ip)0 re + 

it follows that we have a means of expanding p in terms of the vectors /3i , 02 , 
, /? if AQ A n aA n /3 ^ 0; that is, if the set of a s has rank n with 
regard to the set of jS s. Transposing we have 

A a n A n /3i /3 n _i p 



P = 



A n OiA n )3 



A Q A n aA n (3i (3_ 2 |8 n p 

- Pn-1 ~ j T ] a 

AO A n aA n P 



We now set 

P* = (- ) i ~ 1 Ai A n -i 181 I3 n A n ! a n /A A n aA n 13; 

i 

/3 1 is called the complement of f3{ as to the vectors a. In case the vectors a. 
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coincided each to each with the corresponding vectors , then the complements 
would be called the adjuncts of the vectors 0, that is, the adjuncts are com 
plements in the complex of the s themselves. Evidently 

where 8^ is 1 if i = j , and if i j j . The expression of p now becomes 

This is the generalized form for the expansion of p commonly used in 
quaternions. 

The process used above for two complexes is called biorthogonalization of 
the vectors as to the vectors a . The complexes need not be the same, but 
must have their rank with regard to each other equal to the order of the com 
plex. That is to say, the vectors a. must be expressible linearly in terms of the 
common incident complex and other vectors orthogonal to the incident com 
plex. The same is true for the vectors . The vectors outside of the incident 
complex, however, may be quite different. As an example we may show that 
the sets of vectors 

0! , AI ai A&lt;i 0i 02 j AI AZ ai a&lt;L A s 0i 2 0s , 

AI A 3 Oil Ctz Us AI 01 02 03 04 , , 

Ci\ , AI 01 AZ Oil i1 &gt; AI AZ 01 02 AS Oil Oiz Ois , 

AI AS 01 02 03 Ai Oil 2 3 &lt;X} , 

are biorthogonalized sets. 

11. We may now give the complete definition that states the result of using 
either the accretive or the decretive process on two alternants* (counting 
AI as an alternant by definition). It could not have been stated before 
since some of the expressions on the right would have to be defined first. We 
define for all i , j and any vectors a , , 

Ai +] -zk Ai ai otiAj fa fa 

X (A A k ail a ik A $ J1 ftj 

where f we have Ic ^ i , k ^ j , that is,i+j-2k^ \i - j\. The sign may be 
determined directly without using the factor containing the sign, by counting 
the total number of inversions, provided however that in the scalar factor 
Ao 00 the vectors a precede the vectors . This definition includes all 

* No special name is needed, as the next division shows. 

t This X sign merely means that the expression should be written on one line. 
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the others given, as particular cases. In case k = \i , i = j , and (3^ = a.i a , 
i a = 1 , 2 , , i, the expression vanishes. In case k = we have but a single 
term, the sign being + . If k = we have the accretive process. If the 
order of the two alternants is changed it is evident that the result is multi 
plied by ( - ) ii+k . 

The vector just defined, of grade i + j 2k , is the alternant which defines 
the region or space of order i + j 2k containing all the regions of the 
alternants of order i and j determined by omitting common incident regions 
of order j . 

4. THE HAMILTON PRODUCT 

12. Hamilton product of two vectors. Let V n and V m be any two vectors of 
grades n , m respectively, that is, in the complex of a system of alternants all 
of grade n and the complex of a system of alternants of grade m respectively. 
Then we define V m+n ^ k V m V n = A m+n -&lt;ik A m A" n x m y n , where V m = Y. %m A m , 
and V n == Y.y n A n. This merely states the distributive law. We now 
define the Hamilton product (also distributive) for two vectors of grades m , n , 
indicating it by the sign * temporarily. We shall see later that no special 
mark is needed. We remember a scalar is not a vector, though written AQ . 

V m * V n = EtlT" 1 PW-2* V m V n , &lt; TO , &lt; W , V m * X = xV m = X* V m . 

From this definition we have first, since .li i i, 

a* 2 = A ai 0.1 + A* i a 2 

Since the two parts of this product are of different grades, there is nothing to 
interfere with our using the same symbols in front of the Hamilton product as 
selective symbols to indicate the different parts. That is, we may use 

AQ ai* a&lt;2 and AQ a.\ a% , 
An a.\ * 2 and AZ a\ ai 

indifferently. We shall see that this is a procedure which is general. 
Again we have 

a\ * AZ a&lt;i 0:3 = AI a\ AI 2 as + A% a\ AI a^ a?s 

= as A a.\ai 2 ^o ai 0:3 + A s ai a^ as . 

Since a\ * A az a z = i A a z as, by adding, we have 

a\* (2* a 3 ) = i AQ o 3 2 ^o i s + a? 3 A a L a-&gt; + A s OL\ a* a 3 . 

This product consists then of two vectors, one of grade 1, the other of grade 3. 
If we use AI and A 3 in front of a Hamilton product as selective symbols, then 
we may define 

AI a\ * ( a-2 * as ) = a\ AQ 2 s 2 AQ i 3 + a 3 AQ a\ a 2 , 

AZ Oil * ( Ct2 * 3 ) = A S a\ 012 3 . 



208 j. B. SHAW [October 

Evidently 

AI i * ( 2 * a s ) == Ai a s * ( a-2 * i ) . 
We have 

ttl * ( 2 * 3 ) = AI Oil * ( Oil * &3 ) + ^3 Oil 2 3 = ( J * 2 ) * 3 

The vector of grade 3 in this product is familiar. The vector of grade 1, how 
ever, is a new function of a\ , a%, and 3 . We may indicate it as a function of 
of i, a.1, 3 by using the same symbol, omitting the multiplication sign*. The 
alternants we have already written in that manner, the mere juxtaposition of 
the vectors in A n a\ a n not implying up to this point anything in the 
nature of a product. It has meant so far nothing more than a certain defined 
function A n (a\, 2 , , a n ) of n vectors, of first grade. Hence we will 
now define a new system of functions recurrently: 

,, s A n a\ 2 dn+li = A n Oil A n -\ &lt;X 2 n+2i 

+ A n 0.1 A n +\ &lt;Xz OCn+li, &lt; H , 

where ^= i . (The last term vanishes when i = , since A n +i a 2 n = , 
for n + 1 &gt; n . ) 

(2) AO oti az Oizm = AD ai AI a* a^ m 

For instance 

AI &lt;xi oii 0:3 = AI a\ Ag oii G.Z + AI on AI 0:20:3. 

In the case of functional space the number i indicates the number of integra 
tions we perform, there being in all n + 2i parameters which are set equal in 
i pairs in every possible manner, and then each pair integrated, leaving in 
every resulting term n parameters as indicated always by the subscript of 
the A . We see that now A has ceased to have only an alternating significance, 
retaining that meaning when the number of vectors following is the same as the 
subscript on the A . But when the number is greater (it never can be less), then 
A no longer is an alternating sign. The functions, however, do permit certain 
permutations depending upon the grade, as will be shown. 

13. We shall now prove that the definition just given for A n , when the number 
of vectors exceeds n , leads to an expansion in alternants of grade n . That is, 
we shall prove the theorem: 

A n -zi ai oi n = Z) ( - )* Sa+i A a h ov 2 or; 2 . 

X Aj^-zi -on an ( a = 1 , 2 , , 2i ) . 

WI-JM 

As a matter of fact again the signs follow the determinant rule, provided we 

place the subscripts after AQ in natural order, that is, have ji ^= j% ^= s j 2 i . 

For let us suppose this formula holds for all values of n up to and including 
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n = t , and for all values of i , such that 2i S t . Then it follows that 

AI i a&lt;&gt; a t = ai AO OLi at a&lt;i AQ ai a t 

+ + a t AQ ai a t -i 

whenever t is odd, and further when t is odd, so that t + 1 is even, from the 
definition in 12, 

AQ Oil 2 Oit+l 

= AQ a\ AI a-i a t +i 

AQ Oil 0; 2 AQ 0? 3 a t+ i A Q tti &lt;X 3 AQ a 2 4 &t+l 

Now by definition whether t is odd or even 

At+i-2] oiia z at+i 

= At+i-zj &lt;xi A t +z-2j ai at+i + At+i-zj a\ A t -&lt;u a 2 a t+ \, 

where 2 j 2i t + 1 . But each term on the right contains a case that comes 
under the hypothesis since there are only t vectors concerned in the second A . 
Hence we may reduce the right-hand member, and 

Af+l-Zj Oil 2 OCt+l 

= Ati-z ai ^t2-z "2 ..... ~ 1+: 



where ki &lt; 3 &lt; &lt; k Z j-z , / i &lt; / 2 &lt; &lt; hij 
The first term reduces to 



where r is the number of subscripts A - i , A 2 , , k r &lt; A - o in the term, and 
k ^ ki , kz, , k 2j-2 , k = 2 , 3 , , t + 1 in different terms . This reduces 
to 

2] Af+l-Zj 2 3 ..... Oit+l AQ Oil a-2 a k ^ t ( - )J +1+a+ "&gt; 
O.Klky-l 

in which the subscripts are now rearranged so that g &lt; ki &lt; k 2 &lt; &lt; &2/-2 
No k = 1 . In these terms o.\ is missing with 2j 1 others, or 2j including cti . 
The second term reduces to 



where no h is 1 , and hi &lt; h z &lt; &lt; Ao; . 

In these terms i is present, and 2j excluding i are missing. But these 
two sets of terms together give exactly 

At+lZj Oil Oil Oit+l 

Q a kl a kj . ( 
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and hence the theorem holds for the next value of n = i + 1 and for all values 
of j that are possible. Since, however, we know the theorem is true f or n = 1 , 
n = 2, n = 3, it holds for all cases of n an integer. The signs follow the 
determinant rule provided that under A the letters are in their natural 
order, that is, as on the left. We see now from the definition of the Hamilton 
product of two vectors of any grades that 

Oil* [A n 2 Otn+i + A n -i 2 Ci n +\ 

+ + .li or A a* otn+i] 

= A n+ i Oil fX2 (X n+ i + A n -l Oil A n Civ OL n+ \ 

+ + A n -i ai A n -i ai oin+i + 
= A n+ i a\ a n +i + A n -\ ai a n +i 

+ + AO or .-li ai a n+ i . 

Now we know already that 

Ci n -\* OL n = AI Oi n -\ Oi n + AQ &lt;X n -l Oi n 

and 

a n _ 2 * (_!* ) = A 3 Oin-1 OL n ~\ OL n + AI O! n _2 &lt;X n -\ Oin 

It is evident from the theorem just proved that we must have the general 
Hamilton product (in which the multiplier is set at the left) given by the 
theorem 

Oil * [ 2 *( *)] = A n Oil a n + A n -2 1 O&gt; n + . 

If the A s are inserted in front of this product they are understood as selective 
symbols that pick out the term on the right of corresponding grade. We have 
now to show that this product is associative, that is, we may group the vectors in 
the order in which they come, in any manner, and find that the Hamilton 
product of the groups is the Hamilton product of the whole set. For instance 



14. Associativity of the Hamilton product. We shall prove this by showing 
that whether we multiply successively a n by a n -i , then a n --i , , by i , or 
whether we form the product of a n by a n -i , , by on , and then the product of 
this by the product of ai multiplied successively by c^-i , ai_ 2 , , by ai , 
we have the same result. That is, we are to prove that 

\A{ Oil Oti -\- Aiz CKi O!j + ] 

* [ A n -i oi i+ i a. n + A n _i-2 a i+ i a n ] 

= A n a\ a n + A n -z i + 

In the first place let n = i + j . Then 
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where the subscripts written inside the parenthesis will indicate the missing 
vectors from 0.1 , , on . Also we have 

Aj-zm Oii+l Oii+j = X ( - Y h+m Aj-Zm (jl, , jzm ) A (X h Oij^ , 
Ai + j-z(m+p+q~) Ai-Zq 1 Oii Aj-Zm i+l Oii+j 



X Aj-zm (jl, , Jim ) A (Xi, &lt;Xi 2ri A &lt;Xj l Oij 2 

- Z ( - )^ + ^ + " +m+ ^ A i+j -z (m+P+q ) on at ........... a i+i 

tlttftpJlllm+p 

X AQ a^ on 2q A ah a } - tm A 

A p &lt;Xi lq+1 ai 2q+p A p oij^ m ^ &j 2m+p 
Let r = m + p + q , then we may condense the expression just written into 

53 ( ~ Y Ai+j-zr (il i2g+ P jl J2m+p}A A p OL^ 



If now we set q + m = h , p = r h , and holding h fixed let q and m take all 
possible values, we have from the above 

h=r 1=T&gt;, m=0 

2_/ Z A i+ j-2r Ai-zi OLi OLi Aj-zm OL i+ i ;+; 
h=0 1=0, m=h 

~ Zlih ( Y Af+j-zr Oil ..... Oii Oii+\ ..... Oij AQ A r -h ;, 

hi, hlt 

ai s Ar-h OLj^ (Xj t , 

where we must have s + t = 2r and r h : s ^ r + h . But finally summing 
as to h , we have exactly the sum 

/,r Ai + j-zr Oil Oii 0! i+l (Xi+j , 

which is the expansion of the Hamiltonian product of cti , az , , a i+ j in the 
order written. On the left the final sums give the Hamiltonian products of 
MI, ctz, , oii and or,-+i , , ca+j . 

We see now that the special use of a multiplication sign is unnecessary and 
that in all the functions we have built up, with the use of the A and subscripts, 
the array of vectors may be considered to be a Hamilton product, and the A a 
selective sign for the vector of the indicated grade. The remarkable simplicity 
of this result enables us to dispense with any very large collection of formulas 
for transforming expressions in this general vector calculus. We may break up 
by associativity any expression under an A and arrive at various reductions in 
this way. As instances 

AQ Oil Oiz &lt;*3 4 5 Ci = AQ ( Oil 2 #3 4 ) ( 5 



= AQ AZ Oil Oiz 3 4 AZ 5 6 + AQ Oil Oiz Oiz 4 A 5 Oi} 

AZ (Az aia z At & /3 2 j8 3 j8 4 ) 

= A z (ciiaz - AQ aidz] A j3u8 2 /3 3 j34 = Az Oi 2 4 4 )Si j3 2 /3 3 13 4 ) ; 

AQ Ctl Oiz 3 4 = AQ (Xi az AQ Ct 3 4 

AQ 0&gt;i Ct 3 AQ Oi-z Ci + AQ (Xi 4 AQ a 2 3 . 
Trans. Am. Math. Soc. 15. 
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Expansions of this sort are unlimited in number.* Other useful formulas are 
easily derived by combinations, as 

A n aA n _i @A n 7i y n + A n j3A n+ i aA n 71 j n 

= A n CiftAn 71 TnJ 

AI A n &lt;X L a n A n+ i p/3i @ n = ( - )" pA A n i 

On A n 01 fti - X 0; A A n Oil n A n P01 B J 

) 

AI a.\ AI a-i 0:3 + AI a&lt;i A?, 3 i + AI o?3 y! 2 a\ a? = 0; 

^4 2 1 -4 3 2 3 4 ^2 2 ^3 1 3 4 

+ ^4 2 3 /1 3 i Q! 2 4 AI OH As Oil Oil Oiz = 0. 

15. There are other ways by which we might have developed the subject. 
We might have taken a method similar to that of Clifford and defined the 
symbols a, /3, etc., which we have called vectors, as the hypernumbers defined 
by the ratios of geometric vectors; but this method (and others) has great 
disadvantages in the end. We might of course have defined a general or 
indeterminate product and derived from this expressions which would have 
been our A expressions. Some of these will be exhibited later. We should 
have difficulties also this way. From a long consideration of the problem the 
method of development used was settled on as the most desirable from a good 
many points of view. 

16. We proceed to develop a few theorems useful in applications. If we 
consider two complexes of order n each, defined by i , -, and /?i , , j8 n , 
and a vector p , the part of p in the incident complex of the two (say p ) is 
given by 

p A A n aA n 13 = AI (A n ~i pA n a) A n j3 . 

For it is easy to see that 

A n -i pA n ai a n = ^ A oiipAn^i ai ( ) i ~ 1 , 

t 

and therefore the right side gives ]* A a t -p a ! A A n aA n /3, when 
we set for the complement of j 






a n An 0i f3 n /A A n aA n -f3. 



But the form we now have is that part of p which lies in the complex of a 1 , a 2 , 
, a n , that is, in the incident complex. It follows that 



__Ai (An,.! pA n a) A n (3 
A Q A n aA n 



* Conversely every expression A m - ai &lt;*, may be expressed as a sum of products alone 
without any A s. See page 220. 
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is the part of p (its projection) exterior to the incident space. It may be also 
written in the form 



Ai [pA A n aA n @ (A n -i pA n a) A n /3 ] _ 

A A n aA n |8 

17. Another useful form is the following. Since AI A n -i fa fa-i A n a 
is linear in ai , , a n , we see that if we consider a set of vectors defining a 
complex of order r , a\, , a n , a n +i , , a r&gt; n ^ r , then 

A r +l [ AI AnI fa Pn-1 A n Ofi ] A r a\ a r == 

identically. This is true whatever the vectors /3. Hence for any vectors 

Tl ) &gt; Ifr+l 

AQ [A r +i 7l Jr+1 Ar+l AI (/!_! fa 

/3 n _i A n ai a n ) A r ai a r ] = 
identically. We may now change the order and have still identically 

AQ [Ar+l (A r &lt;Xl &lt;X r AI A n -l fa 

/?_! A n ai a n ] A r +i 71 7 r+ i ] = . 

We may however drop the first A T+ \ since A Q in front prevents any additional 
terms from entering, and then for the same reason we may insert A r as shown, 
arriving at a formula from which many useful formulas may be deduced: * 

A A r ai a r A r [Ai (An-i fa 

fa-i A n ai a n ) Ar+l 7i 7 r+ i ] = . 

18. A similar formula may be deduced from 

A x Ay Pi Py A X +y CX.1 a x +y , 

which is linear in A x a^ a.i x . If then we consider the set ai , , a x + v + z 
where z may be zero, we have the identity 



(A x Ay fa /3y Ax+y &lt;*]. a i+ y)x4 I+v+z U\ (Xx+y+Z = 0. 

Hence whatever the vectors 71, , y^x+y+z, we have identically 



AQ [Azx+y+z 7i Yix+y+z A^x+y+z 

(A x Ay fa faA z+u ! a x +y)A x + y+z ! &lt;x x + v +z] = 0, 
whence for all /3 s and y s we have 

A [A x+v+ z ! (Xx+y+z A x 

(A y fa faAz+y ai a x + y }]A- ix+y + 2 ji jix+y+z = 0. 
Particular cases occur for z = . 



* Volterra, Rice Institute Pamphlets, vol. 4 (1917), No. 1, p. 58 (3). 
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19. In the expression 

Afr+s-Zx AT 0.1 Oi r Jr+s Oil Ci r /3j j3 s 

2l Jo A x Oii 1 - OL ix A x OLj l Oij y /3 fcl 

Pk A 2r +s-2x &lt;Xi ..... &lt;X r Oil ..... r 01 ..... 0s 



the vectors a that enter the right-hand side are all different, and there are not 
more than r, hence we have 2r x y =i r or r = x + y . Further if the j 
set and the i set do not together make up the whole r set, then there are inevi 
tably duplicates in the last factor which would cause it to vanish. Hence the i 
set and the j set must cover all the r vectors i , , a r , though they may over 
lap, in which case in the last factor the overlap would be missing. In each A 
we may reduce with no alteration in value by substituting an orthogonal set 
for the vectors appearing there (indicated by accents). The factor A would 
then vanish if it contained any vectors j3 . Hence the vectors of the i set and 
those of the j set must define the same complex. They must therefore each 
be the whole set ai , , a r Hence for any value except x = y = r the 
expression vanishes, and when x = y = r it reduces to 



20. If we consider the bordered determinant 



Jo aB AQ a/3] 


Jo 


a0 2 


A afin 


Jo 7i 8 Jo 7i/ 


3j Jo 


7i 02 


Jo 7i 


Jo 7n 5 J 7nl 


3i Jo- 


7 02 


J 7n 0" 



we see that it may be condensed into 

( - ) ire(re+1) A A n+ i fijSijS, /? A^i 7i 72 7n . 
If we expand by Cauchy s theorem the determinant is equal to 

A A Q ad - X Jo 0i Jo Sjj Ay , 

where A is the determinant of /3 and 7 alone, that is, the one bordered, and Ay- 
is the minor of A produced by omitting the column containing & and row con 
taining 7y . Now interchange a and 5 and subtract. The first terms cancel 
and the others become 



2 J J 2 a8A 2 0i % A iy = 2Z Jo J2 a8A z 

Pi 7 y Jo Jn-i 0i n J n -i 71 7 ( - 

ft 7, 

This is easily reduced to 
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Hence we have the useful theorem 

A A n+i a@i p n A n+l 



j3 n A n+ i 0:71 j n = ( - )" A A 2 aSA*. A n j3A n 7 . 

21. We notice that 

A* &gt;&&*&lt;* = A n+l - (A&gt;fa-- P**a) + An-i- (4 B -/3i---/3*a) 
= ( - ) [An+i oft ft, - A n ^ aA n fa ft,] . 

Hence we have since A n+z aA n+ i a ( ) == , and A n - Z otA n -\ a ( ) = , 
a* A n ft.-.& a 

= ( - )" [An aA n+1 Oft j3 n - A n OiAn-i CxA n ft ft, ] 

= ( - ) [A aaA n j3 l n - 2A afr A n a/3 2 

ft, + 2J /3 2 J n /?!& /?] 
= yl n a/3i |8 B a . 

It is apparent then that if we multiply an alternant of grade n by the same 
linear vector on the right and left, we have a vector of grade n, which is the 
part of grade n of the product of the original product by this vector on the 
right and the left. 

Since we have A Q a/3i @ n a = A aaA /3i /3 n and since 

aa = AQ aa, and a A /3i I3 n a = aaA /3i (3 n , 
it is evident that for any expression 

-4, j9i jS,, (m^n) 

we have 

a*^ m j8i /?* = ^ m a/?i /3 n . 

A vector of grade 1 which comes at both extremes of any vector of any grade of 
a product may be set outside the vector symbol of that grade. This enables us 
to make reductions. For instance 

afi^a aAn fiy a. + aoiA Q j3y; 
afiyaS = aaAo j3y + aAn fiy ab . 

This theorem is useful in studying invariancy under rotation about linear 
vectors. The papers of Joly and M Aulay should be consulted. 

It is evident that continued application of the theorem is possible. For 
instance 

( ai a n ) V m Pi (l r ( a n ai ) = V m ai a n @i /3 r a n a L . 

22. Since the formulas we are to consider in this section are homogeneous in 
all vectors on both sides, we will consider that the vectors are unit vectors, that 
is to say for each vector we have 

& = A, & = 1 . 
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We may introduce multipliers easily to produce vectors not unit vectors. 
We will occasionally indicate the opposite of a vector by a stroke over the 
symbol, thus 

a = a. . 

We have then aa = 1 . We shall use the stroke extended to cover more 
than one vector to indicate that the order is to be reversed and all the signs, 
thus 



= a n 



This is called, in quaternions, taking the conjugate. 

This operation will not affect the form A i a n , for this vanishes 
when n is odd, and if n = 2??z we have by definition of A i azm 

AQ a\ ai m = ^ ( ) p A a il a jl Ao a,-, &lt;Xj t A a&lt; m a im , 

where the subscripts are so arranged that i x &lt; j x (x == 1 , 2, , m) and p is 
the number of inversions in the total collection of subscripts so arranged. It 
simplifies the count a little to arrange either the subscripts i or j in their 
natural order. It is evident now that as each factor may have its vectors 
interchanged, and as a change of both signs does not affect the value, we will 
have in any case 



Oil On = AQ (Xi 



(X n 



Hence if we remember the expansion of the form A m on - a n , we see that 
as every term is an alternant of grade TO , and as 



we shall have 

-*-7n *-* 1 ^~*-n "~~ \ ) *-YTL * ^-1 * * (-*n 

Hence we have the four forms for any product TT = a\ a n , according to 
form of n , 

Aim TT -- A^m TT&gt; A^m^-i 7T = A^m^i 7TJ 

-^I4m-f2 7T := A^m-^z 1"J -^i4m+3 7T = ^4 m _|_3 7T. 

It follows that 

7T = (v4o -^1 AZ -}- AS + ^4 A 5 ) 7T . 

Hence 

! (TT + 7r ) = (-4o + A + ) TT if TT is of even grade, 

^ (TT TT) = (^2 + A$ + )ir if TT is of even grade, 

| (TT + TT) = (.4i + &gt;4 5 + )TT if TT is of odd grade, 

^ (TT TT) = (^ 3 + AI + )TT if TT is of odd grade. 
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We may also state the theorem thus : * in A , A , we may reverse the order 
of the product, in A\ , A 5 , we may reverse the order, in A% , A s , and in 
AS, AT, we must change the sign if we reverse the order. We may also 
state it thus : the substitution ( 1 , re ) ( 2 , re 1 ) ( 3 , re 2 ) leaves invariant 
A , AI , , AI , A 5 , and changes the sign of A 2 , A 6 , , A 3 , A^ , 
Again we notice that if n is a vector of grade m , then 

na = Am+l n& + A m -l M &gt; M = A m+ i afJL + A m -i a/JL , 

since all other grades vanish, and by the formula in 11 we have when we 
interchange the vectors 



an = ( } m [A m +i M A m -i 

na = ( - } m [A m +i an - Am-i an]. 

This gives us at once 

na + ( - } m an = 2A m+ i fj.a =-- ( - )" 2A m+ i an , 
/*-(- T an = 2A m _ l na == (- ) m ~ l 2.4 m -i an . 

We have from this, reverting to TT = a\. a n , and writing TT S = A s TT for 
brevity, 

ira = X) TT S a = ^ ( ) 3 [A s+i air s A s -i 7r s ] . 

We also have 

air = X a7r = X [A s +-i a-K s + A s -i air s ]. 

It is clearer if we write these out in full, retaining only the terms actually 
present, 

air = AQ airi + A\ a (TT O + 7T 2 ) 

+ A 2 a(7Ti + TTs) + A 3 a(lT2 + 7T 4 ) + , 

Tra = AQ airi + A\ a(ir 7T 2 ) 

- Az &lt;x(TTi 7T 3 ) + A 3 a(TT 2 ~ 7T 4 ) + 

Of course when TT is even all the even terms are zero in these, and when TT is 
odd all the odd terms are zero. We have from these, by combining, 

| [ air + ira] = A Q airi + AI air + A z air^ + AS air?, + , 
i [air -- Tra] = A\ air* + A 2 airi + A 3 air^ + A air s + . 

We have a number of interesting results from these. For instance, taking A 
of both sides we find that 

AQ Tra = A air . 

Hence A is invariant under the substitution (1,2, , n) . This substitu 
tion taken with the other which leaves A invariant gives a group of order 2n . 
* Since A (ir - w) = 0, A 4 (* - T) =0, etc. 
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Since in the expanded form, where n ~ 2m, we see that there are 2 m ( m \ ) 
substitutions which leave the individual terms unchanged, and there are 
(2m) ! substitutions altogether, there will be in the development of 

AQ ai a n 

= (2m - 1 ) (2m - 3) 3 1 terms. 



2 m (m!) 

There will evidently be for n = 2m vectors of grade 1 , J ( n - - 1 ) ! different 
expressions AQ a\ a n . 
Again we have 

A\ Tra == AI air 2Ai cnr 2 = - A\ air + 2cnr ; 

AZ Tra = A 2 air 2A airi = -- A z air + 2 Jo a-7r 3 ; 

AS Tra = AZ air 2A 3 air = -- A 3 air + 2J 3 air*; 



A m ?ra = A m air 2A m air m i = - A m air + 2A, n a7r,, i=F i, 

where the upper signs are used if m is odd, the lower if m is even. Returning 
to our previous procedure we have 



(j,a(3 = /jiA a/3 + /j,A 2 a/3 = nA a/3 + 

+ A m M^2 a/3 + A m+ nAz aj3, 
= nA af3 + A m - 2 (A a/3)jn 

- A m A 2 a/3M + A m2 J 2 a3 



(A 2 a/3) /* + A m (A z a/3) n + A m+ , 

We have at once 

/ua/3 = a/S/z 2A m (A z a@)n, 
whence easily 

7ra/3 = a/Svr - 2^ A m (A 2 a/3)7r m . 

We need not proceed in this fashion, for it is simpler to make use of the con 
jugates, thus: let 

TT = ai a n and T = ^ f3 p ; 
then 

TTT = ff = X ( ~ )" m(m - fl) A n TW, 
and since 

T7T = 2Z A m T7T , 

we have the general formula 

TTT TTT = X A m (TTT ( - ) 
Now by expanding each expression in the form 
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and similarly for T , and cancelling all terms that must give , we arrive at any 
of these forms directly. The terms retained are of the form T P TT S where 

m ~\p s\ , p + s 5= TO . 
If we take conjugates of the expression just written we have 

VT + TV =-- Z A* [ ( - ) }m(m+1) "T T*] . 

By combining these formulas by addition and subtraction we arrive at a 
variety of forms. 

Applying the same method to three products as factors of a single product 
we see that 



We combine this with CTTTT = ~^A m airr to produce various formulas. In 
particular let a and T be vectors of grade 1 ; then cr = - &lt;j and r - T , so that 



Tir&lt;r ---- 
We have then 

CT7TT T7T&lt;7 = ^A 

Remembering the grades of cr and T , the only possible grades for the [] which 
will not give zero terms are the grades m 2 , m , m + 2 . The bracket there 
fore reduces to 2ir m for the upper sign and 2 (-7r m _ 2 + 7r m+2 ) for the lower sign. 
Hence 

CTTTT + TTTff = 2^ A m &lt;77T m T , 

~ T7TCT = 2/. A m [ cr7T m _o T + CT7T m+ 2 T] . 



We finally reduce these as follows : 

&lt;77TT + T7TCT = 2^ ^m V [ A m+ i TT m T + A m -l 7T m T ] 

= 2^ ( ) W ^ O t^m+l T1T m .4 m _i T7T m ] 
= 2^ [lC m AQ &lt;7T A m ff A m -\ T7T m 

A m TA m -i (77T 



CTTTT TTTCT = db 2^A m (Ai &lt;JT ) ( 7T m _ 2 + 7T m+2 ) 

These formulas give the effect of transposing two vectors in such a product. 
They are very useful For instance when we introduce V it will be seen that 
we may set V for a , and for V-4o r ( ) + rA V we may write 2&lt;f&gt; ( ) . If 
then we set ir l m for the value of TT when in the vector of grade m in each term of 
the expansion for ir m (say A m 71 7) we write &lt;J&gt;Q, thus: 

A m (f&gt;o ( Tl ) 72 7m + A m Ji &lt;/&gt; ( J 2 ) 7m 

+ + A m 7172 &lt;Ao (7m), 
we have 

VTTT + TTTV = 2irA Vr T 2^ TT^, 
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where V acts only on T. The expression A VT will be called later the 
divergence of T . Likewise A z VT will be called the curl of T . We see then 
that when curl T = 0, VTTT = TTTV We cannot elaborate on these, but this 
single example will show how simple the transformations are without any 
great display of formulas of reduction.* 

A few cases need to be written down, however. They will explain them 
selves. We use TT and T as products of grades i and j respectively, and a as of 
grade 1 : 

TTf TJ = (- Y j [A i+j - At+j-s + Ai+j-* - }r j -jr i ; 

f (TTi Tj + ( - Y J Tj TTi) = [Ai+j + Ai+j-4 + JTTi T j\ 
I (TT; Tj - ( - ) ij Tj TTi) = [Af+j-z + A i+ j- 6 + ]lTi TJ-, 

Ai+j^iTTiffTj = Ai+j-l[lTiAj + l OTTj + (A;-i 7T; &lt;7 ) Ty 

+ ( - Y&lt;rA i+j TTiTj}; 

Ai-j+i TTi ffTj = Ai-j+i [7T; Aj-i VTj + (Ai-i TTiff) Tj 

+ ( - YffA i+j TTiTj]. 

23. We consider now more particularly A ai a&lt;i n In the first place 
from the development it is obvious that the form is a pfaffian. Hence if we 
square it we shall have a skew determinant of even order, that is, 

(A on 







AO &lt;xi az 


AO cxi as 


Ao 


ai a* n 





Ao 


ai a 2 


AQ (Xv Q. 3 


Ao 


Oiz OiZn 





A 


1 2n-l 


. . . 


Ao 


Oiln-l Otin 





Ao 


Oil 2n. 










It is evident in this form that advancing each subscript cyclically is equivalent 
to moving the first row and column to the last row and column, which does not 
change the value of the determinant. Reversing all rows and columns does 
not change the determinant. The number of positive terms in the pfaffian is 
one greater than the number of negative terms, hence not all terms can have 
their signs changed without changing the value. This shows again the permu 
tations possible in this form. If any vector is orthogonal to all the others the 
pfaffian vanishes. From the form O-TTT + TTTCT if m == we have 

AQ ffTTT + AO Tir(T = 2A TrA CTT . 

Hence if a and T are orthogonal the right side vanishes. Hence interchanging 
two orthogonal vectors in the A form merely changes its sign. 

Let AO () represent A on oi 2n , A (ii, i- 2 , -, i 2m ) represent 
A on oi Zn , m &lt; n, and let ( - Y +k A ik be the minor of 

AO ai oL k . It is clear that if we differentiate the expressions A Q ( ) as to 

*By means of these forms we can reduce every expression A m -on- to sums and differ 
ences of products alone, thus ^0-0:10:2 = 5(0:10:2 + 0:20:1), Afotia 2 = 5(0:10:2 o: 2 ai), 
= 1(0:10:20:3 + &lt; 
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A on a k partially, we have ( - Y+ k ~ 1 A (i, k). If we differentiate the 
determinant in the same manner we have 2 ( ) i+k A ik . Hence 

A ik = - A ()A (i, k}. 
From a well known theorem in determinants if 

A (ii , ii , , i m ; ji,ji, , j m ) 

is the determinant produced by erasing from the original determinant the rows 
indicated b y the i subscripts, and the columns indicated by the j subscripts, 
then we have 



In case the subscripts i and j are the same, A is a skew symmetric determinant 
also and is the square of A (ii, , i m ) . The determinant on the left in 
any case reduces to 

AQ (iiiz) AQ (iii m ) 



- A (imil) 

This vanishes if m is odd, giving a theorem as to A , and if m is even the de 
terminant is the square of a pfaffian, giving a series of theorems about expres 
sions built up from the forms AQ (i, j) in the same formal manner as the 
forms A are built up from A a.i a.j. We cite only the following: 

A Q (a, b)A (c, d) - AQ (a, c)A (b , d) 

+ A (a, d)A (b, c) = A (a, b, c, d) A (); 

AQ (a,b)A - (c,d,e,f) -- = A (a, b, c, d, e, f}A (). 

They are easily written and extended to all orders. 

24. Space does not permit further developments here, but enough has been 
said to make it easy to continue. The particular forms that are related to the 
linear vector operator, whose general form would be 



furnish a complete part by themselves. When the number of vector terms 
becomes infinite we have functional transformations, and functions of lines 
with related subjects enter. This development is deferred to later papers. 

5. THE DIFFERENTIATOR 

25. We should however mention here the differentiator (corresponding to 
the Hamiltonian nabla) related to a vector p = ^.T; a 4 , expressed as depending 
upon n parameters, i = 1,2, , n , 

V = X) 
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This symbol may be used anywhere just as if it were a vector, for all vector properties, 
and is supposed to differentiate in situ any vector dependent upon the parameters 
entering p . It differentiates also in situ any function or operator dependent upon 
these parameters. This one simple rule enables us to write out at once an 
indefinitely great number of forms for differential work. No special collection 
of formulas is necessary. The mere fact that V may follow an operand is a 
principle that introduces great simplicity. We observe here that in the case 
of functions the process of differentiation of a function of a line used by 
Volterra is a case of the use of a V in which the number of parameters is non- 
denumerably infinite. 
It is evident that we have 



d- () ---Ao-dpV 

for any operand dependent upon the parameters of p . (Note the parameters of 
p belong to the complex in which p is variable, and do not mean the coordinates 
of p, which would be usually more numerous and might be infinite in number.) 
A simple example of the use of V is shown in the following. 

If we have a pfaffian differential form w = ^,y% dxi we know that it may be 
reduced to a set of terms udv + pdq + rds + . In case there is but one 
term and that is dv, the expression is called integrable or exact. If the single 
term is udv it is said to be integrable with a multiplier, u~ l . The number of 
such terms was shown by Pfaff to be not more than half the number of param 
eters x, if this is even, and not more than half the number plus one, if the 
number is odd. 

Suppose w T e can reduce w to udv . Then we have by setting a = 

Vcr = VV^ + ^V 2 1} , where V 2 = W , 
whence 

A 2 V^Vfl = A z Vo"&gt; and A 3 a^cr = 0. 



This is a necessary condition of integrability with a multiplier. In case u = 1 , 
we have as the necessary condition A 2 Vo" = 0. Considering likewise the 
next form 



w = udv + ds = AQ dp(uvv + ys ) , 
we set 

a = u^Jv + V*, 
whence 

V0- = VV + W 2 v + V 2 s , 

A z Vff = A 2 VV, A 3 0-Vo- = A 3 VsVMVc, ^4 Vo"V&lt;r = 0. 

This problem has been studied by Cartan* who gives the conditions above in 
*Annales de 1 Ecole Normale (3), vol. 18 (1901), pp. 241-311. 
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scalar forms which, owing to the arbitrary differentials, are equivalent to 
these : 

AQ AZ VvAz 8\ p5 2 p = , 

A A 3 (r\?o-A 3 5i p5 2 p5 3 p = 0, etc. 

The expression A V&lt;r is called the divergence of a , and A z W is called 
the curl of a . 

We have other differential expressions, as for instance in the ordinary 
relativity theory in four variables, we consider A 3 Vo"2 called the curl of the 
2-vector (vector of grade 2 ) cr 2 . The complement or adjunct of this in the 
region of order 4 is called the Lorentzian of a . It is A \ A 3 Vo"2 A^ 0)4 . 

We may build up expressions with V which are covariant for unique and 
invertible transformations &lt;/&gt; (p ) . Such are 

A V2 avA-i di pd-i p , AQ AS irVirAs d L pr/ 2 pd s p , 

In fact the possibility of expressing a function in such wise implies certain 
covariant properties. 

26. In general if we write the form A dpa , and consider the equation 

AO dpa -= , with Az n +i crVtrVo" W == identically, 

we undertake to find v such that A Zn V^Vo" W = , where both forms 
A are alternants. From the latter, multiplying by dp and taking A 2n -i we 
have 

Vo" = = dvAz n -i 



A 



+ ( n 1 ) Atn-i yvdff Vo- + 

Set dv = 0, so that one solution is v = const., and the remainder of the ex 
pression furnishes an equation which must hold for arbitrary differentials of 
the parameters of p . Setting their coefficients equal to , we are able to find v . 
For instance let us have 

Xi x 3 dx-z + x\ 2 dx 3 + ( Xi + x 3 x 5 ) dx^ + x 3 x dx$ = . 

We have cr = Xi x 3 a* + Xi x 2 a z + ( x\ + x 3 x- a ) a 4 + x 3 x t a 5 , the vectors 
ai, , as being constant. It is easy to show that A 5 aVo Vo = and four 
variables will suffice. To find v so as to reduce the expression, we notice that 
since the form will be udv + rds 

a = 



Either V y r V^ is common to both, hence A t TjvffVv = . We assume the 
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form for such that 

V = 1&gt;l &lt;Xi + 2 2 + 03 3 + 4 4 + 5 5 , 

and substituting all values we have a vector of grade 4 which vanishes so that 
each coefficient of the alternants A a\ a 2 a 3 04 , A a\ a* a 3 5 , etc., 

vanishes. These are here 

x\ x b 3 x 2 x 3 x & v 2 + xi x 3 Xi vi = , 

xl Xi 3 Xz X S Xi V 2 + 3-1 .T 3 Xi l = , 

x 2 x 3 x 5 vs + x 2 x s x 4 Vi x 3 x 4 v 3 + ( x 3 Xi .r 6 .TI x 3 x 5 ) vi = , 
- x\ x 3 .T 5 5 + o-i .T 3 a- 4 vt + ( .T 3 .r 4 .TI x 3 x- )v 2 = . 

From these we have 

.TI TI .r 2 2 + x 3 v 3 = , 

X 4 2 a- 3 X 4 ^4 + .T 3 .r 5 1 5 = . 

These are satisfied by taking either v = x\/x$ , or .r 2 .r 3 + fl 4 , or 3-4 .r 5 . 

The equation A riper == means that dp is everywhere orthogonal to the 
congruence given by A 2 dpcr -= . This is called the characteristic congruence. 
If the equation is integrable with or without a multiplier, the congruence 
satisfies A 2 Vo" = or A 3 a^Ja = 0, and this makes it a normal congruence. 
That is, there is a function of n -- 1 of the parameters,/, such that A crVf = . 
The integral in fact is/ = const. The other cases, Ai Vcr Vo" == , etc., give 
special congruences worthy of study. 

27. The integral J Ao adp depends generally upon the path of p , that is, 
p is taken as an arbitrary function of a single parameter, s , p = p ( s ) , and dp 
becomes p ds . The terminal values of s furnish the limits. The integral 
evidently depends in general upon the form of the function p(s). If the 
initial and the final value of s are the same we say we have integrated around a 
loop, and indicate the fact with the sign $ . If we follow any path for which 
we have AQ &lt;rdp = 0, it is evident that the integral from any point to any 
other is zero, and such paths are orthogonal lines for the characteristic con 
gruence. In case they can be collected upon spreads (functions of one or more 
parameters) these functions are called solutions of the pfafHan form. 

If we follow a loop or closed circuit in the integration, and choose a spread 
of order 2 (a surface), attending as usual to singularities, we may introduce 
elementary areas and circuits, arriving at the generalized Stokes Theorem, 

fA adp = ffA A 2 v&lt;rA 2 dipd 2 p, 

over the area. This theorem still applies in case the line-integral is not taken 
over a circuit, if we can draw through the terminals of the actual path orthog 
onal lines as mentioned above so as to complete a circuit, since the integral 
along the lines introduced is zero. If the curl of a is zero at all points of the 
area the loop integral evidently vanishes. 
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i 

6. INTEGRAL INVARIANTS* 

28. Suppose that we have a general function linear in dp, Q(dp) . If we 
integrate this over a loop the form of Stokes theorem is 



where the accent shows the operand of the accented V- This may be con 
sidered to be the difference between two distinct paths from an initial point to 
a terminal point chosen on the loop, the double integral being taken over the 
enclosed area. If the paths differ infinitesimally it becomes the variation of 
the integral for a variation in path. Therefore if 

Q (A l v A 2 [] ) ==0 identically, 

the variation is zero, and the integral is called an integral invariant for the 
variation in path. We do not necessarily have to retain the same end points, 
for we may move either along any line which satisfies the equation Q(dp) = . 

When the integral is invariant it is called the integral of the exact differential 
form Q (dp) = . For instance, A-&gt; adp is exact, the integral being Ay. ap . 
In this case we have identically A 2 a(Ai yA 2 []) = 0, since a is constant. 
The lines which satisfy AI adp = are p so. . The path could be moved 
parallel to a. , leaving the integral invariant. 

29. It happens sometimes that an integral remains invariant when the 
whole loop is displaced, as for instance the strength of a vortex tube. The 
general condition is obtained thus: let the integral be taken over a region of 
order n (that is, there are n independent parameters and these are varied 
between appropriate limits), 

In = S n Q(A n d 1P d 2 p d n p), 

where f n means re integral signs or an ?*-fold integral (the iteration of an 
integral in order to procure an area-integral, or a space-integral, is incidental 
here and may not be necessary). Suppose now that we displace the region so 
that p becomes p + dp ; then we may consider the effect of this on any function 
of p by noting that 

5 = AQ SpV- 
Hence 

dl n = Ao d P VS n Q ( A n 8 1P d nP ) 

= S n A - dpv Q (A n ---} +f n Q(A n d l S P dp ---d n p} 



= /.4 OpV Q (An d lP d nP ) 

+ f n Q(A n dp An-1 V A n d 1 p d n P ) . 



* The contents of this division were included in a paper pres ented before the Society, 
Chicago Section, January 2, 1913, Integral invariants in general vector analysis. 
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This is called the first form. Integrate the last term by parts, combine, and 
we have 

5/n = S n Q (A n - VA n+l Spdip - dnp) 

+ S H [Q(An 5 P )] J ?l _l V A n d iP d nP . 

This is called the second form. The expansion of the last term gives a number 
of terms each of which is integrable once, and we have 

Sin = /" Q (An V A+1 5 P d lP d n p) 

+ Z f^ [ ( QAn 8 P ) Bi ~ ( QA n S P } Ai ]A n - i -d lP -- t -d nP . 

Let 6p follow a path so that dp = abt; then we have 
dl n /5t = f n Q A n V A n+l -ffdip - dnp 

+ ES^KQAn -&lt;r) Bi ~ (QA n &lt;r) At ]A n - l -". 
If we had retained the first form we would have 

S/n/* = / A oV Q (A n )+/" Q (An ffA^ V A n ). 
If now I n is an absolute invariant, we must have for the paths given by cr 
A &lt;TV Q (A n ) + Q (A n a An^ - tfA n )== identically, 
or for the second form 
Q (A n - v A n+l .ffAn&gt;-) + (QAn ff^An-i V A n ==0 identically. 

These conditions are very general and include every form as usually stated. 
As an instance, consider the integral J A Q rdp . This will be an absolute 
invariant for the congruence cr if 

Ao &lt;rVAo -T()+Ao- ra A V () = 0. 
The condition may also be written 

V -4 a r -- r A V o" or V(-4 err) = -- AI crA VT 

In case the boundary is fixed, then at the boundary dp = 0; if the integral 
is to remain invariant for all a , we see from the second form that as the second 
integral is now zero, and the first holds for all a , we must have 

Q, (A n - v An+i ) ==0 identically. 
For instance, in the example just considered we would have 

A r A-i V A 2 = identically, 
whence 

AZ VT = identically. 

The situation considered here is the extension from an integral over two 
distinct paths, with fixed terminals, to an integral over two regions of order n , 
with fixed boundary. Such an integral is indicated by Goursat* by 1% . When 
*Journal des MathSmatiques (6), vol. 4 (1908), pp. 331. 



I 
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the boundary is closed we have simply an invariant, called a relative invariant, 
indicated by J n . 

30. In a manner similar to that for Stokes generalized theorem we have 
Green s generalized theorem 



In = In+1 = f n Q(A n - d LP d n P ) 

= S n+ iQ (A n - v A n+1 -d 1P ---d n+lP ). 

The second integral is evidently of the type I d . Since now a relative invariant 
integral may be written as an integral of order higher by unity, the condition 
of its being invariant may be stated. That is, 



is a relative invariant if 

/ki = f n+l Q ( An V A n+l -d 1P -.- d n+1 P ) 
is absolute, that is to say, if 

A aV Q (A n v ^n+i) + Q (A n v A n+1 v"A n V A n+l ) = 0, 
or if 

Q (An- V A n+l V A n+2 a ) + (Q An V A+i &lt;r)"A n V"A n+l == 0. 
This gives us the condition (since A n AI v V A n+ % vanishes identically) 

[Q A n v An+i -a]" A n V"A n+ i =0. 
For instance in the case considered above 
A r A l V ^2 a"A^ y"A z () + A r A l V ^ 2 v ^ 3 &lt;rA, () = 0. 

Hence, reducing, A 2 vAi aAi VT = identically. 
31. Since we may always write 

A (TV ( Q An 0- ) + QA n ffA n ^ tr A n -2 V ^n-l 

= A aV Q (A n a ) + Q(A n a A n ^ V A n a), 
the condition above becomes 

A (TV Q (A n - &lt;7.4 n -l ()) + Q(A n (T An-! V A n (T.-ln-i) = 0. 

Hence if / = S n QA n is an absolute invariant, 



is an absolute invariant. 

32. From any integrand Q ( A n ) may be formed by two processes two 
other integrands, one by the process D , giving Q ( A n V A n+ i ) , and the 
other by the process E, giving Q ( A n aA n ^ ) . In case the latter form 

Trans. Am. Math. Soc. 16. 
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vanishes identically, we mark the integral I n of Q (A n ) with an e, thus /;, . 
In any case the second application of the E process gives an identically vanish 
ing form, hence from any / the E process produces an 7,1-1 Integrals are of 
four types with respect to these processes, that is, according to whether or 
not they can be marked with d or e . The effect of the D process on the inte 
grand is to mark the integral with d, of the E process to mark the integral 
with e . We need to prove that if an integral invariant has either character 
the other process will not destroy this character. That is, 

( jd) = (PY. 

E produces from I [ an 7;!li , that is, if the first is from an exact integrand, the 
second will also be exact when the first is invariant, and also will be invariant. 
For from 

Q (AnV ) = 
and 

Q (A n - v A n+l a ) + ( QA n cr } A n ^ V A n = 
it follows that 

(QA n -ffYAn-l V-" =0, 

which is the condition that the result have an exact integrand. To be in 
variant we must also have 

(QA n ffAn-KfYAn-2 V ==0, 

which is evident. Again when QA n a =0 and / is invariant we have 
also 

Q A n V A n+l a = 0, 

showing at once that Q A n v is of type E . That it is also invariant 
demands that 

Q An V An+1 VA n+2 a + ( Q A n V A n+l &lt;rYAn+l V A n = 0, 

which is evident. Hence the two operations are permutable when applied to 
an invariant, one giving Q A n V A n+l aA n , the other 

(QAnVYAn^V An---. 

If Q is invariant, however, the first expression is the negative of the other. 
The characters are then unchanged. If we start with an invariant and apply 
the E process we arrive at an I e of one lower order, usually not identically 
zero, and if we then apply the D process we have an integral of character I de 
of the original order. For instance, from A rdp we have, by D , 

- A A 
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by E , AQ TO- . Using the other process in either case we have from the first 
- A AZ - VrA 2 adp, from the second (A crr^ Ao V dp. In case now 
AQ rdp is an invariant the new forms are also invariant. 

The D process is a sort of generalization of getting the curl, while the E 
process fixes certain paths for integration, along which the lower integral may 
be displaced. 

As instances, let I ; = f A rdp, and let I have both characters d, e; that 
means 

A 2 VT = 0, 

and for a certain congruence given by dp = crdt , 

A TO- = 0. 

The expression is exact on account of character d; hence the integral is 

u = const. 



This is a function of p , and such that T = z^u . Hence we have also Vw 
everywhere orthogonal to the congruence line or, by the second condition. 
That is, o- gives lines lying in the spread u = c . 
Let 

I z = J J A 2 7T 2 A* di p C? 2 Pi 

then 

AQ - A-i 7T 2 A 2 VA S = 0, A A 2 7T 2 A 2 ffAi = identically. 
We have easily A 3 V^ = 0, A A z ir z A* &lt;rX = for every X; that is, 

A i a A-i 7T 2 = identically. 
From the first condition 

7T 2 = Az v ; 

hence 

Ai ffA 2 V = 0. 

Put dp for adt , and we have along the congruence 

Ai dpA 2 v = 0, or d == V ^o ? dp . 
This means, if we set &lt;t&gt; for the linear vector function V^l ( ) &gt; 

rf = &lt;f&gt;dp . 

We have then formally dp = (jr l d . The complete discussion is long. 

A comparison of the methods used here and those given in the coordinate 
treatments of these problems will show the enormous gain in simplicity. 

33. To solve the differential equation 

Q A n dpTTn-.! = 
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means that we must find a congruence of curves such that if dp is taken along 
any curve the equation will be satisfied, and that x(p)Q is an absolute in 
variant for the curves. That is, setting dp = adt , 

Q An-ff --=0, 
(xQYA n vA n+l a + (xQA n vYAn-i V A n = = 

= (xQYA n VA n+ i ff , 

= Q A n v An+i ff + Ao - &lt;rv log x QA n . 
Hence we must have 

QA n -a == 0, Q A n v A n+i a = t QA n , 

for some t, and t may be zero. For instance let Q be A dp = 0. The 
condition is then 



A -ff^ = Q, A A 2 
For a particular case let % A\ pA 2 ap , where a is constant. Then 
AZ V = 3.4 2 ap , A A z &lt;rpA* ap = , AI aA 2 ap = tAi pA 2 ap. 
These are satisfied if we take 

a- = tp + any vector orthogonal to a. and to p . 

Hence dp must be in the plane through the origin containing the common 
perpendicular to a. and p. Hence dp is in any plane tangent to a cone of 
revolution about a as an axis. This cone must therefore furnish an integral, 
that is, 

AQ pp = c 2 (A ap) 2 . 

34. A system of equations 

= A 1 dp = AQ 2 dp = = A n dp n 
is equivalent to a single equation 

A n -l dpA n ! = A n -i 1T n dp = . 

The conditions then reduce to 

A n -l TT n (7 = , A ffVAn-1 ( ) 7T n -- AQ ( ) V^n-1 0"7T n = tA n -i ( ) 7T n . 



We cannot enlarge on the use of these forms in the study of differential 
equations, but enough has been said to indicate the procedures. We will, 
however, show the application to differential geometry. 

II. DIFFERENTIAL GEOMETRY 

1. Let p be a vector dependent upon n parameters Ui, u 2 , , u n , thus 
defining a region of order n . For instance if n = 1 we would call the region 
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a curve, if n = 2 , a surface, etc. We have then 

p = p(Ui, W 2 , , ll n }. 

2. If now we let the parameters vary and assume that the function p is 
differentiable as to each parameter, indicating the derivative by a correspond 
ing subscript, we have 

dp = P! dui + p 2 du?. + + p n du n 

where p; = dp/dui , and the n vectors are in general linearly independent. 
We will indicate the adjunct of p, in the region by p i so that 

p = ( )-! A\ A n -l PIPI Pi-l P. +l Pn A n pi p 2 

Pn/A A n pi Pn A n pi p n . 

For convenience we shall write v = A n p\ p n so that v is an alternate vector 
of grade n, which is also a function of the point at which it is calculated. We 
also set (/ v} z = A vv , and Uv = v/Iv is a unit vector. Where v = 0, we 
have singular points. 
It is clear that 

AO pip 3 = 0, if i ? j , and A p, p = 1 . 
Further 

A n p 1 p 2 p n = V/AQW, 

which is seen by expanding each in terms of pi , p 2 , , p n and applying 
the theorems of determinants. 

3. The expressions Ag pt Py for i, j = 1 , , n are called the fundamental 
quantities of the first order. They are commonly represented by a y -. The 
corresponding forms AQ p i p 3 = a i} are the adjunct quantities of first order. 

It is clear that 

| 0tf | = ( - )&lt;n-i) Ao vv, \a ij \ = ( - 1 )*("-D AO-JV. 

4. The biorthogonal systems p, and p* are useful. Suppose for instance we 
have a system of values l s which we attach to p s giving a vector 

X = XPS L; 
then we may define a new system of values 

m = AQ p r \, 

and from the properties of the vectors p we have 

M = 2] in r p r = X Pr AO p r X = X . 

The coefficients m and the coefficients I therefore give the same vector when 
attached to the proper vectors. For the vector p in terms of p t - , and the vector 
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p in terms of p i we may introduce a new system of n independent vectors defin 
ing the region, and their biorthogonal system. As we are dealing with the 
vectors themselves we shall not be concerned with their mode of expression. 
Such questions however play an important part in the developments of Ricci 
(Lezioni sulla Teoria delle Super fide) and those who follow him. 

5. If we have a vector == A" r p r , and in the other form, = ]A" r pr&gt; 
then //p r is such that X T /Ip T = the orthogonal projection of on p r . In 
other words the orthogonal projections are given by AQ Up T . Similarly for 
the other form. 

6. If we differentiate AO p r p s = or 1 we have in either case 

AoprtP* + A p r (p s ) t = 0, 

a useful formula. From it we have the Christoffel * form 
( rs I 

[ = Aoprsp 1 = -- A p r (p ( ) s = - A Q p s (p ) r . 
( l } 

Evidently 

Prs = -- ^PtA Q p r (p*), + Ai UvA n+i 



The second termf in each form gives that part of p ra outside the region of order 
n . The other Christoffel form is 



[7]- 



7. Since A n +i pi p 2 p n p m = we have 

dA n+ i vp m = 0, or A n+l v(p m ) z = - A n +i v,p m , 
and likewise 

A n +l l&gt;Prs = A n +i V r p s - A n+i V s p r . 

There is usually a non-vanishing expression of first degree which is orthogonal 
to the region under consideration (pi, , p n ) 

We note that Uv might as well have been used here as v . We can construct 
n* expressions 

AO An-^i p rs vA n +l Vptu, 

which can also be written 

AO Pr v s A n +i v t p u , or AO p r Ai v s A n+ i v t p u , or AO p u v t A n +i p T v s , 

* This is easily reduced to the usual cartesian form by writing out p ra in terms of pi , , p n . 
For the other Christoffel form expand p ra in terms of p , , p n . 

f It is to be noted that in general p rs has components not linearly expressible in terms of 
Pi) , PI as for instance for a surface, which is imbedded in a three-dimensional space. 
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and in other forms produced by permissible interchanges of the subscripts. 
However this is a quadratic in p r , p u of the form A p r 4&gt;Pu , where &lt;/&gt; is a linear 
vector function. Its transverse would be 

A p T v t A n+ iv s p u . 
We will set 

n+l (Uv) t () -- N., (), 
n+l - (tfr). --N. t ---- N &lt;5 . 

The difference of these two forms has received a symbol due to Riemann : 

(ru, st) = AQ p r [(Uv) a A n+ i ( Uv) t 

- (Uv) t A n+ i ( Uv) a ] p u = A p r ( tN st ) PU . 

There is also another Riemann symbol * of order four, 

\ru, st} = A p r N s ,p" - A p r N, s p" = A OPr (eN s ,)p", 

where e ( N ) = f ( N -- N ) , N being the transverse linear vector function of N . 
A better form may be given these by utilizing a previous formula, f which gives 

{ru, st} = AoA 2 prp u A 2 ( Uv} s (Uv} t 

- AQ p rs VvA Put Uv + AQ p rt Uv AQ p ut Uv, 

{ru, st} = A Aip r p u A 2 (Uv) t (Uv) t 

= A Prs Ul&gt;A (p u )t T^V + AQ p rt Ul&gt;A (p") s Uv . 

From these expressions it is evident that the two symbols in each couple may 
be interchanged by changing the sign, and the two couples may be 
interchanged. 

8. The expression A dpdp = ^4 p r p s cht r du s is called the first funda 
mental form. In case the Riemann symbols vanish it can be reduced to the 
sum of not more than n squares, that is, new variables may be found in terms 
of which the parameters u may be expressed so that, for the new derived 
vectors p , we will have 

A -dpdp = Z^op p r(rfw ) 2 . 

The condition is both necessary and sufficient. If some or all of the Riemann 
symbols do not vanish, then it will be necessary to express A dpdp as the 
sum of not more than \n ( n + 1 ) squares. From one point of view this is 
equivalent to considering the region to be embedded in a region of not more 
than \n ( n + 1 ) dimensions in which we may take the directions of p z - as 
orthogonal. 

For instance a curve on an ordinary surface has a normal in the surface and 
* These may be identified with the forms commonly used by substituting and reducing. 
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also has the normal to the surface as a normal, or indeed all lines in the plane 
of these two. Hence the ds z is expressible in terms of three differentials in 
ordinary space which encloses the surface, in the form 

ds 2 = dx 2 + dy- + dz 2 . 

On the surface alone it would usually have terms of the form 

du 2 , diidv , dv 2 , 

and would not be reducible to the sum of two squares. 

9. We may improve the forms we have reached so far by the introduction 
of the operator which generalizes the Hamilton V On account of this general 
ization we shall use V in a general sense. It is defined thus: 

V - Zp d/dWi- 

It follows that d ( ) = A rfpV for any function of the parameters u only. 
Hence we will have d/du r = A p r V ; therefore we may write for ( Uv) T 

(Uv} r = A p r V Uv. 

The operator 

Ao () V l"v = $ 

plays a very important part in all these problems and gives of course the 
vector rate of change of UP in the direction given by the unit vector operated 
upon. With this symbol we may write the Riemann symbols in the forms 

(rU,St) = AoA 2 p r p u A2^p3^Pt 

with the understanding that s , t may be interchanged with r, u; 

{ru,st\ = A A z p r p u A z $p a $p ( . 

The number of Riemann symbols with two pairs of identical subscripts is 
\n (n -- 1 ) , with one pair of similar indices (n -- 1 ) (n 2 ) , and with all 
different ^n (n -- 1 ) (n - 2) (n - 3) , giving a total of T Vn 2 (n 2 - 1) . 
It is clear that for any four directions in the region we may have a symbol 
A AZ af3A 2 &lt;f&gt;7&lt;5 , which for unit vectors may be called the Riemann symbol 
for those directions. There is also, if we take differentials for the intensities 
of the vectors, a differential quadrilinear expression 

G 4 = A A 2 d P d" P A 2 



Since every symbol is invariant for a change of parameters, this is a covariant. 
We see at once that for any six directions we have a similar form 

G 6 = AQ dip d 2 p d s p A 3 &lt;S&gt; dt p $ d 5 p $ d 6 p , 

and there will be higher forms as high as G 2n . These are all covariants, and 
are generalizations of the Christoffel quadrilinear covariant G . 
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10. From these we derive covariant quotients, which give us in the first 
instance the Riemann curvature for the plane of di p , d?. p , and by generaliza 
tions, Riemann curvatures for the regions of di p , d z p , . These are 

K ,7 , s _ AO A m di pd 2 p d m pA m $di p$c? 2 P $d m p 

A m (ai a m ) - 



O A m dipdzp d m pA m ttl prf 2 P m P 

When we arrive at the covariant of order K n it is evidently the single coefficient 
of the Christoffel form for the same order. This number is called the 
Kronecker-Gaussian curvature of the space. 

11. If we construct the second differential of p , d~ p , we have two parts to it, 
namely, that part which has components in the region, and the part which is 
orthogonal to the region. Since we have identically 

A n+ i Uvdp = 0, 
it follows that 

An+i dUvdp = - A n+ i Uvd- p. 
But 

d z p = *piA P*d 2 p + A! UvA n+l Uvd z p 

= J^PiAo p i d- p - Ai UvAn+i dUvdp. 
We set now 

= AI UvA n +i dUvdp. 

It is evident that A Pi % = for i = 1 , , n, so that is not in the 
region. is called the vector second differential form since the coefficients of 
its expansion are scalar second differential forms.* We evidently have 



The coefficients AI UvA n+ i (Uv} T p s are fundamental vectors of second 
order. They depend upon the parameters and are not covariants, but they 
furnish a number of covariants. We see that 

A n+ i dUvdp = A n +i Uv. 
Also 

AI UvA n+ i dUvdp = - AI- UvA n+i Uv. 
But in general 

Al A n ai &lt;Xz Oi n A n +\ 7/3 1 /3 2 ]8 n 

= (-YyAo -A n aA n l3+ ( - ) A, (A n ^ 7 A n a} A n p, 
hence 

Ai UvA n+l dUvdp = d P A UvdUv + A L (A n ^ dpA n Uv] A n dUv 

= Ai (A n -i dpA n Uv)A n dUv, 

* Cf . Wilson and Moore, Proceedings of the American Academy of 
Arts and Sciences, vol. 52 (1916), pp. 270-368. 
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and also 



A! UvA n+l .Uv = t + Ai- (X-i SUv)A n Uv. 
But is orthogonal to all vectors p, hence A n -\ %Uv = 0. Hence 

A n dUv. 



If we go back to the theory of curves we see that if we write the differential 
arc in the usual form dp = ds a ,* then d 2 p = d 2 sa + dsda and as AQ ada = 
it follows that the part of d 2 p outside the region consists of part of the term 
containing da. If da c/3ds , == AI A n UvA n +i Uvfi c ds 2 , which is 
the part of /3 c ds 2 orthogonal to every vector pi , , p n . 

12. It is evident that the first differential quadratic and whatever is deriv 
able from it of a covariantive or invariantive character will be true of a region 
which is flexible but inextensible. That is to say, bending the region would 
have no effect upon the first fundamental form nor upon the quantities derived 
from it without differentiation. However the second fundamental vector 
form does not possess this character and demands a rigid space. This is due 
to the fact that d 2 p contains a part which is not in the region itself. For 
instance if we are dealing with a curve in three or more dimensions, the first 
fundamental form is merely ds 1 , and as long as we do not consider d 2 p we 
merely work in the line itself which for all such purposes might as well be 
straight. If however we consider d 2 p = d 2 s a + ds da where a. is the 
unit tangent (or better the hypernumber belonging to that particular tangent), 
then we must either abandon the line or consider da which makes the 
line really straight. 

It is to be noticed that the expressions (Christoffel symbols) A p rs Pt can 
be found from the region itself, since 

2A p T sPt = (^oPrPt)a + (A p s p t ) r (A p r p s ") t , 

all of which could be computed in a practical case from measurements inside 
the region. From these we can compute the expressions A -p rs p ( . The 
Riemann symbols are also computable in the same manner, since 

(rt, SU~) = (A PrsPt~) U (A p ru pt}s 

+ X) [^0 P h p k (A pruphA p st p k - A prsPhAo p tu Pfc ) ] . 

Covariants built out of these forms can likewise be computed directly in the 
space, and would be independent of any particular directions such asr,s,t,u. 

13. If a = Udp , then da has two components, one in the region, and one 
exterior to the region. The latter is /ds . The former is called the geodesic 
vector curvature of the curve of which a is the tangent. If it vanishes the 
curve is called a geodesic for the region. This may also be found as a curve of 

* Where a is the unit tangent vector. 
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minimal length. For let 

s = f -vU dpdp; 
then 

6s = I d^A dpdp = I Aod5pdp/ds = I A ddpa. 

Integrating by parts, the first part being then exact and vanishing for an 
extremal, we have as the condition of a minimum (or maximum) AQ 8pda = 
for every Sp in the space. That is to say, we have da wholly exterior to the 
space. We might also write this fact in the form 

A n -i daA n Uv = . 

It is understood here of course that the da is for a displacement along the 
curve. 

14. The covariant #4 can be written 



GI = Ao A 2 dipdz pA 2 da Uvdi Uv 

and is thus dependent upon the four directions of differentiation. Now we 
have 

dUv = d(v/Iv) = dv/Iv - UvA Uvdv/Iv. 

Hence as A 2 A n a\ a n A n ai a n = ,* and therefore A 2 UvUv = 0, 
G 4 = A Q A 2 di p dz p [ A 2 d 3 v d A v 

A 2 UP (da vA di v di v AQ d 3 v ) Uv]/A vv . 

If we let four tangents to the space, that is, four unit directions, be 
a\ , a-i, s , on , we have 

Ki = Ao A 2 ai ai A 2 $ dz $ a^/Ao A* a\ 2 ^2 3 0:4 . 

The Riemann symbol ( rs , tu) may be computed from observations in the 
space as follows: 
We find first 

(Ao p r tPs)u (A$ pruPs)t- 

Then we find in the region A p rt p su - A p ru p, t which is subtracted from the 
other value ( 12). The difference is ( rs , iu ) . The accents on p indicate that 
we are measuring so much of p rt , etc., as we observe in the region in question. 
In the first expression these accents are not needed, as the form shows that the 
other parts of the second derivatives drop out anyhow. If we multiply by 
du u and du t and sum for all values of the differentials we have 

d\ AQ d 2 Pr Ps di A f/i p r p s AQ J 2 p r dl p + AQ dl p r d z p s . 

* A! A n on &lt;x n A H en &lt;x n = S-4o A B _i on a n A n -i on " a n A t ai ocj , which will 

&lt; } 

contain (Ai on a,- + A^ a, a,- ) Ao A n -i i a n A n -i a\ a n , and thus all terms cancel. 



238 j. B. SHAW [October 

Multiply again by du s du r and sum again, and we have 



- AO dz d 2 p di d^ p + AQ d\ d 3 p L rf 2 d t p . 

This is the best form for this quadrilinear co variant, since by choosing the 
directions of the four differentials properly we may obtain any one of the 
Riemann forms. 

15. The Riemann-Christoffel tensor G ( can be written in the form 

(Ao A 2 aj3A 2 ViVzA 76)", 

where Vi acts on 7 only, V2 on 6 only, a, /3 are purely arbitrary, the double 
accent means that only components exterior to the space are concerned. 
a , (3 , 7 , &lt;5 are unit vectors. If this is to vanish for any a , /? , then 

(A 2 ViV 2 /io 75)" = 0, 
and this may be written 

where $ y is the derivative dyad of 7 , or 4&gt; y dt = d f 7 , the differential of 7 in 
the f direction. In other terms &lt;f&gt; y = AO () V 7 and &lt;f&gt; y = VA 7 () . 

If this vanishes for every 6 \ve must have (At V7)" = 0. This can 
vanish for every 7 only when all the Riemann symbols vanish, which inter 
prets the meaning of the vanishing of all these symbols. 

16. The Einstein gravitation equations translated into our notation are 

Gij = - ( Z AQ p ijp k )k Z AO p ik (p k )j 

+ (Ao Vpz)/ + Z ^o p ij (p k }k 

- AQ Vpij -- Z AO Pjj (p k )k ~- 22 A p i,, (p k )j 

- AO Vp ij Z AO p ik (p k }j + Z AO p ik (p k )j + Z AO p k pijk 

- AO Vp ij + AQ Vpij = (A Vpa)" = for all i,j. 

Therefore the significance is easily seen to be that the convergence of the 
exterior parts of the second differentials is zero. That is, if a is a unit tangent, 
AQ Vda" = . Hence the sum of the normal curvatures of n orthogonal 
geodesies at the point is zero. The Einstein curvature is 

Z 9* Gi) ~ Z AO P* V 2 p = Z" A p r p s AQ p 1 Ptrs. 

The Riemann-Christoffel covariant is written also without the accents (since 
components in the space lead to terms that cancel) 

AO At aj3A 2 Vi Va AQ 76 , 
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and if we substitute for 7 the adjunct of /3 , and sum for n linearly independent 
values j8i with j3 l , we have the Einstein co variant in the form 

Gas = 2] A A&lt;i afa A-i V: V 2 A /3 8 = for all a , 5 . 

This gives ^A A 2 afa A 2 V$ s @ { where V operates only on the fr . This takes 
the form 

Z ( lo &lt;/ a? A fa V - A 



which is another form of the potential. 

If we start from the forms (A p (r p s )! (A p ru p s ) f and, making 
s = u , sum, we have 

Grt = AQ VPrt + ^ AoPrt (p")s = (A Vpr)&lt; = AQ VPr&lt; 

as before. 

The simplest way to get the second derivatives is to choose paths in the region 
on the geodesies. There is then no normal in the region. If then the con 
vergence in the region of these geodesic normals vanishes, the space is of the 
gravitational character. 

If the Riemann-Christoffel covariants vanish for an ordinary space we know 
it is a plane, and a surface nabla of the normals (which would run out into 
three-dimensional space) all vanish. To be an Einstein space of two dimen 
sions it would be necessary for the convergence in the space of the geodetic 
normals to vanish. For a sphere, for instance, the normals all pass through 
the center, their differentials have any direction on the surface, and would 
not necessarily give a convergence equal to zero. A sphere is therefore not a 
possible Einstein surface. 

TENSORS 

17. As commonly used the term tensor means merely a multiplex of coeffi 
cients obtained when we substitute in a lineo-linear function of m variables the 
various vectors chosen m at a time from the set pi , , p n We shall use & to 
indicate a tensor, which from our point of view is an operator. To indicate the 
operands we will insert m parentheses after E . A tensor is an extension of the 
term dyad, triad, etc., used by Gibbs. We will write then a tensor of order 1 
as H ( ) , one of order 2 as S ( ) ( ) , etc. The first fundamental form is a tensor 
since it has the form A () () . From it we derive the set of quantities 
Ao p r ps- It is clear that all our A symbols, A m , furnish tensors. A 
tensor is a covariant tensor when it reduces to a form independent of pi , , p n - 
All the symbols A and their combinations furnish tensors that are covariant. 

A tensor is a derivative tensor, and of order higher by unity than another, 
when it is formed from the other by the process 

ZOO (Mo- v(). 
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For instance from the tensor (a covariant, as p\ , , p n are not in evidence) 
we derive 4 V ( ) where V operates only on . If = V-ST then we have 
VXA V ( ) If is a differential operator, as A ( ) V &lt;r , then we derive 

AQ () V 0" ^0 V() 

If we indicate direction by a subscript as o&gt;, then the derived tensor demands 
that a TS = ffsr This condition is necessary. It is further a sufficient condi 
tion that S be a derived or differential tensor. 

All differentials or derivatives taken in this manner without reference to 
the parameters u are called covariant differentials. The so-called contra- 
variant systems would be found by using the system p i . 

18. To make translations from the ordinary notations we have to notice 
that the covariant tensor A xv means the form ^,A xv Ao p x (} A p" () , the 
contravariant tensor A xy means ^A xv AQ p x () A p v () , and the mixed 
tensor A x means ^A x A p x ( ) A p v ( ) . 

The so-called product of two tensors of orders a , b is a tensor of order a + b . 
For instance the product A uv B T S means 

E A uv B r s A Q P U ()A - p (Mo P s (Mo P.O. 

In expressions where " dummy " indices appear, that is to say, duplicated 
indices, the expressions are summed as to the operational terms indicated, for 
instance, A s ura means A s ur3 A p u ( ) A p r ( ) A p s ( ) A p s ( ) . 
We will translate some of the current notations into these forms: 

Ctuv , w = [ UV, it)} == AQ Puv Pw, \UV, w} = AQ p uv p w , 

where the subscript 1 shows the operand of V- This is covariantive 
differentiation. 

With these indications it is easy to reduce any expression into the vector 
forms, and then to reduce the vector forms to compact expressions which 
from their forms alone show that they are covariantive. 

CURVES 

19. Let a be the unit tangent to a curve (that is, a function of one parameter 
s) , which gives AQ aa = 1 . We have therefore at once 

AQ a da = . 
Hence da is orthogonal to a . Let j3 be a unit vector so that 

da = c{ /3 ds . 
c( is called the curvature and /3 the principal normal. Usually part of /3 is 
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in the space of the curve, part not in the space. For instance, a curve on a 
sphere may have its principal normal lying in the embedding three-dimensional 
space. We may also write, then, 

da = (gi X + iii v)ds, 

where now gi is the geodesic curvature, n\ is the normal curvature, X is a unit 
normal in the space, and v is a unit normal out of the space. 

Since AQ aft = 0, differentiating again, AQ adfi = - AQ fida = c[. 
Hence we have for dfi (the dp is still along the curve) 



where 7 is called the binormal, and c i is the second curvature. Since the 
first term gives the entire projection on a we must have AQ ay = 0, and 
since AQ /3&lt;f/3 = , we must have also AQ fij -= . Continuing we find the 
trinomial, etc., and the third, etc., curvatures, giving the generalized Frenet 

equations: 

da = c{ fids , 

d/3 = (ci 7 - Cia}ds, 
dj -- (c C b -c [P)ds, 

f/V = - ci (m) K ds . 

When c\ = the other curvatures become indefinite but are taken as zero by 
convention. If c{ ^ and c( = 0, from the equations above 

A 3 dpd 2 pd 3 p = 0. 

The condition is necessary and sufficient. Hence, differentiating, 
A 3 dpd-pd^p = = A a dpd 2 pd 5 p, 

and all higher curvatures may be taken as zero. 
20. A congruence of curves is defined by the equation 

A z \dp = 0, 

where X is a given unit vector in the region.* Let us consider n independent 
congruences, one curve of each congruence through any one point, and at the 
point let all the congruences be mutually orthogonal. To indicate the 
orthogonality we shall use a. instead of X . Then 

AQ on ah = 8ih, 
and using 4&gt; ai to indicate AQ () V on, the operator that gives the differential 

* The notation of Ricci is as follows in vector forms: X i/r = AQ X; Up r , X^ = Ao \hP r , 
where iand h define different congruences; X;/ rs = Ao p r (Xi) a , 7.-,-* = Ai&gt; X,-(Xi)t. 
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of oti in the direction of the unit operand, we have 

AQ an &lt;/&gt;, ak = - AQ OLj ai ctk for all i j& j -^ k . 

Such an expression is called a coefficient of rotation for a reason that will appear. 
There are \n z (n 1 ) such expressions. We have also 

AQ Oih &lt;/,. &lt;Xl = if h 7* I . 

We may expand A p r 4&gt;&lt;* h P* in terms of ^A Q p r AQ oij p s A on 4&gt;a h otj . 
This is the Ricci formula 





but such expansion is not needed in the vector forms. 
21. Let 

e = j 2 -a,-&lt;M) 

be an operator which converts vectors of grade unity into vectors of grade 2. 
Of course it is a very special operator of this kind. The coefficient of rotation 

Tijfc = A Oii Otj AZ Qcik 

We see that Qdp = \ X!-4 2 a f don gives what may be called the instantaneous 
rotation for a displacement dp about a plane. That is to say, the plexus of 
congruences rotates about this plane through an infinitesimal amount given 
by the intensity of the 2-vector. (Rotation about a plane or other spread is 
possible in hyperspace.) We can look at the situation in this way: let 

T = AI 0-677 = - X ^rjAo aj a 

and AO oik T = A cr&lt;j) ak n = A Q adak/dt , A &lt;TT = . The coefficients of 
rotation are the components in the directions of the elementary bivectors 
A?. on cij of the bivector Qa^ . 

We can also express 9 a little differently in a form useful in the study of 
n-tuply orthogonal congruences.* 

Let 

r = Z A 2 V a i A an () = -- 2 A* V a, A Q ,(), 

the accents showing the operand of V Then 

6 - T = \A* () A, oi &lt; V = - - \Ai () A 3 ai Vi- 
The expression A 2 Vi is of course the curl of on . 

22. If /(p) = is a spread of order n - 1 , the normals are V/and form a 
normal congruence. Since f hk = f kh , we have 

AQ a h VA a k V / - A a k yA a h V / = . 



* See Shaw, Triply orthogonal congruences, these Transactions, vol. 21 (1920), 
p. 391. The present articles contain the results of a paper read December 31, 1915, before 
the Chicago section, On orthogonal congruences in space of three and more dimensions. 
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As V operates upon following a s we have from this 
A V/Oa,, at &lt;/&gt;, A) = 0. 
This might he written 

X (r*fc - 7ihk)A a t V.f = 0. 
Let the unit part of V/ be ; then the equation gives 

/4 ( &lt; a t a n &lt; aj . a/, ) - . 
That is, for the entire set 1 of curves in the spread and normal to it, 

Juki, = 7hk (h,k ~ 1,2, ,n 1). 

The equation also states that &lt;j&gt; ak is self-transverse (symmetric) so far as the 
spread is concerned. The condition is also sufficient. For when it is satisfied 
A J-&gt; a h a/, J- 2 Vo = ( * for all h, k =-- 1 , , n -- 1 . But 

AZ V&lt;* == ]C - r J A a n a, , 
hence we have 

.In a n Vot n = , 

whence fA-&gt; Vn == A* %a n for any ^. Choose = - V and we have 

A z V(ton) = n, to B == V/- 

From this we have also A 2 V log 1a n - - A% V n == AI a n IT , where TT is 
in the spread and is indeed &lt; n a n . 

23. Let us suppose now that a n is the principal normal of a system of curves 
through the point, that is, a; is the unit tangent, dati/ilx for dp along the curve is 
d a n , where / = 1 , , n -- 1 . That is, 

&lt;/&gt;, &lt;Xi = CiOLn for all i = 1 , , n -- 1 . 
Hence 

J 2 n 0a. i = 0, ^4 ay ; a, = -- ^4o a,- aj a,- = 

(i,j = 1, , ~ 1). 

This may also be written to parallel ithe treatment in three variables referred 

to above: 

AI &lt;XiQa&gt;i = C{ a n , 

A (X n 4&gt; a , a, = Cj. 

If the curves all lie in a spread as in the preceding article, 

V log t = &lt;p an a n r + xa n , 

AQ a n (j) ai Oij = AQ (X n &lt;t&gt; a/ Oii . 

In the general case of all the curves merely such that a n . is. their common 
principal normal, the sum of the curvatures is 

Ao Oi n X &lt;/&gt;, a,- (i = 1, , ??.) . 

Trans. Am. Math. Soo. 17. 
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which reduces to 

or X " - o oti&lt;t&gt; an ai = -- v/n Vo- 



This shows on the one hand that the sum of the curvatures of the normal 
curves is independent of the particular system chosen, if they are mutually 
orthogonal, but on the other hand it gives significance to the geometric con 
vergence of . Dividing by n -- 1 would give the mean curvature of the 
system. 

The second curvature of the curve with a, as tangent is 

&lt;/&gt; , = f t y, c( ,-. 
Since a and 7 are orthogonal, 

.l-&lt; ot-i 4&gt; an on = c" a, 7, , 
and we may write 

c" is the intensity of A% a, &lt;/&gt; an a,-. 

24. The spread of order n -- 1 is called isothermal when 

A w/ == = V 2 /- 

The resulting equations are simple. 

25. In case the curves in the region through the point in question are 
geodesic we have 4&gt; Ua a n = in tin- region, that is, for every / 

AQ oti 4&gt; a a n = . 
That is, 

A a n &lt;f&gt; a , OL n = 

for every i . This may also be written A a n a, Qa n = . This form enables 
us to state the condition in the form A l a,, Qa n = in the region, and this 
vector must lie outside the region therefore.* 
Further applications must be deferred. 

*See page 213. 
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REPRESENTATIONS OF A COMPLEX POINT BY PAIRS 
OF ORDERED REAL POINTS * 

BY 

W. C. GRAUSTEIN 

1. Introduction. A complex point z in a complex space of n dimensions, 
since it depends on 2n real parameters, can be represented by a pair of real 
points, X and Y , of the space. It is reasonable to restrict the choice of X 
and Y by the following conditions: 

A. The points X and Y representing z shall be ordered: X - Y . 

B. If 2 is a real point, X and Y shall coincide in z . 

C. If z is imaginary, X and Y shall be distinct; if X - Y represents z, 
Y -&gt; X shall represent z , the point conjugate-imaginary to z . 

D. The coordinates of X and Y , referred to a rectangular coordinate 
system, shall be analytic functions, without singularities in the finite domain, 
of the coordinates of z and z , referred to this system. 

E. The representation of z by X -&gt; Y shall be invariant under a chosen 
group of real point transformations; that is, the ordered points X -&gt; Y into 
which the ordered points A &gt; Y are carried by an arbitrary transformation 
of the group shall represent the point z into which z is carried by the 
transformation. 

If X &gt; Y is a representation of z which satisfies these conditions, Y &gt; X 
is also a representation of z satisfying them. The two representations shall 
be called inverse. 

There are two pairs of inverse representations satisfying the prescribed 
conditions which are well known: those of Laguerre in the plane, whereby z 
is represented by the real points of intersection, taken in the one or the other 
order, of the minimal lines through z and z; and those of Marie in the plane 
and in space, in which the real points of the representation lie on the line zz , 
have the same midpoint as z and z , and are at a distance apart whose square 
is the negative of the square of the distance zz.f The two Marie representa 
tions are invariant under the group of affine transformations, whereas those of 
Laguerre are unchanged by the direct circular transformations of the plane. 

This paper deals with the determination of all the representations X &gt; Y 

* Presented to the Society, December 31, 1919. 

t For a review of the literature concerning these representations, see E. Study, Vorlesungen 
uber ausgeivdhlte Gegenstande der Geometrie, erstes Heft; Ebene analytische Kurven und zu ihnen 
gehorige Abbildungen, Leipzig, 1911; J. L. Coolidge, Geometry in the Complex Domain (in press). 

Trans. Am. Math. Soc. 17. 
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of z which satisfy conditions A to D , and are invariant, in compliance with E , 
under a given group. In 2 the problem is formulated analytically for an 
arbitrary group. In the subsequent sections, 3-7, it is solved for various 
subgroups of the projective group, for the protective group itself, and for the 
group of direct circular transformations of the plane. Only in the case of the 
projective group are no representations obtained. 

Given a group, G , of direct transformations and the set, /S , of corresponding 
indirect transformations, it is of importance to determine, among the repre 
sentations invariant under G , those which are invariant also under S , and 
secondly, those which are inverted, i.e., carried into their inverses, by S . It 
turns out that all the representations invariant under G are invariant also under 
S , except when G is the group of direct circular transformations, or when G 
is either the group of motions or of direct transformations of similarity in the 
plane.* In the latter cases certain representations of the totality invariant 
under G are unchanged by S , whereas certain others are inverted by S (8). 
Moreover, it is only in these same cases that the representations are of excep 
tional nature in comparison with those obtained for the same group applied in 
a space of different dimensionality. 

2. Analytic formulation of the problem. Let (zi , z* , , z n ) , where 

Zi = Xi + iyi (i = 1 , 2 , , n) , 

be the coordinates, referred to a rectangular cartesian system of axes, of the 
point z , and let the coordinates of X and Y , referred to the same system, be 
respectively ( Xi , A" 2 , , X n ) and ( YI , F 2 , , Y n ) . 

Since the latter sets of coordinates are, by condition D , real analytic func 
tions of Zi , 2 2 , , z n , z\ , zi , , z n , they are also real analytic functions of 
the real variables x\ , y\ , x 2 , y z , , x n , y n . It is convenient, in light of B , 
to write these functions as follows: 



Xi = Xi + Fi Oi, y-i, , x n , y n ], 

Yi = Xi + Gi (xi , ?/i , , x n , y n ) ( i = 1 , 2 , , n ) . 
ByC, 

Gi Oi, 2/1, , x n , 2/) = Fi Oi, y i} , x n , - y n ] (i = 1, 2, , n) . 

Conditions A to D are thus covered by demanding that Xi and 7,- be of 
the form 

Xi = Xi + Fi (xi, yi, , x n , 2/ n ) , 

Yi = Xi + Fi ( xi , - t/i , , x n , y n ) ( i = 1 , 2 , , n ) , 

where F , F z , , F n are real functions of the real variables x\ , y\ , , x n , y n , 
which are analytic for all finite sets of values, vanish identically when 

_ 2/i = 2/2= = y n = 0, 

* In this summary of exceptions, one relatively unimportant case has been omitted. 
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and are furthermore such that the system of equations 

Fi Oi, 7/1, ,x n , y n } = Fi (a-i, - y\, , x n , - y n ) 

has no real solutions in y\ , y% , , y n except the solution 0,0, , . 
Let the equations 

(2) z i ---- r,-(zi,2 2 , ,) (* = 1,2, ...,n) 

represent an arbitrary group of real point transformations. In order that 
the representation A" &gt; Y of the point z be invariant under this group, it is 
necessary and sufficient that 

(a) x t + Fi (x(, y{, , x n , y ,,} 

= Ti (.TI + F l (xi , y l , ,x n ,y n ), 

(3) x n + F n (xi,yi, , x n , y n ) ) , 
(6) x t + F i (x(, -y(, ,x n , ~y n } 

= Ti (xi + Fi (xi, -7/1, , x n , - y n ), , 

X n + F n (Xi, ~ 7/1, ,X n , -?/)) (t == 1 , 2, , n) . 

Thus our problem, finally formulated, consists in solving the system of 
functional equations (3) for the functions FI, F z , , F n , subject to the 
conditions stated under (1). 

If the transformations (2) are linear, as will frequently be the case, 

x t = Ti(xi,x 2 , , x n ) , 7/1 = T (yi,y z , , y n ) 

(i ---- 1,2, -,), 

where T is Ti minus its absolute term. Changing the signs of y\ , 7/2 , , y n 
then changes the signs of y\ , y 2 , , y n - Consequent!}^ the two systems of 
equations in (3) are, in this case, each equivalent to the single system 

/ 4 ) Fi (x( ,y(, ,x n ,y n ) = T (Fi(xi, y i} , x n , y n ) , 

, F n (xi,yi, , x n , y n ) ) (i = 1 , 2, , n) . 

3. Representations invariant under the group of translations. Inasmuch as 
the group of real translations is a subgroup of each of the other groups under 
which the in variance of representations is to be considered, it is advantageous 
to deal with it first. Equations (4) then become 

F i (x l + a l ,y l , ,x n + a n ,y n } = F i (x l ,yi, ,x n ,y n ) 

(i = 1,2, -,). 

Since t , a 2 , , a n , the parameters of the translation, are arbitrary, the 
system is satisfied if and only if FI , F 2 , , F n are functions of y\ , y% , ,y n 
alone. 

The representations X -&gt; Y of z invariant under the group of real translations 
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are given by 

T Xi = Xi + Fi ( y l , 7/2 , , y n ) , Yi = Xi + Fi(- yi, - y 2 , , - y n } 

(i= 1,2, ,), 



FI, FZ, , F n are arbitrary real analytic functions which vanish for 
= y 2 = ... == y n ana 7 ore swc/i ^a /ie system of equations 

Fi (yi, 2/2, , y n } = Fi(- yi, - z/ 2 , , - y n ) (* = 1,2, , n) 

no rea/ solutions other than 0,0, , . 
In the case of a group of linear transformations containing the group of 
translations, (4) can now be replaced by the system of equations 

,~ Fi (y{, 7/2, , y n ) = Tl (F l (yi, y 2 , , y n ) , 

, F n (y\, yz, ,y n )} (i = 1 , 2, , n), 



* 



in which Xi, x 2 , , .r n , x( , x 2 , , x n no longer appear. 

4. The group of homothetic transformations. The representation A Y is 
invariant under the group of real homothetic transformations 

z{ = pzi + ai , p ^ ( i = 1 , 2 , , n ) , 

if and only if the system (5), formed for this group, namely 

Fi(pyi,py, , py n ) = pF { (yi,y z , , y n ) (i = 1,2, , n ) , 

is satisfied. But then, regardless of whether p is unrestricted or takes on only 
positive values, F\ , F?, , F n must be linear and homogeneous in 

y\,y&lt;i, , y n - 

The representations X &gt; Y of z invariant under the group of either direct or 
general homothetic transformations are 

n n 

II Xi = Xi + XI kn yj , Yi = Xi - X hi Vi (* = 12, ,), 

3=1 3=1 

where the k^ are real constants subject only to the restriction that the n-rowed 
determinant \ka does not vanish. 

* The equations (5) are also necessary and sufficient conditions that the real analytic point 
transformation 

(a) y { = Fi(yi, 7/2, ,2/ n ) (i = 1, 2, , n) 
be commutative with each transformation of the group of real linear transformations 

(b) Vi = ^(2/1,2/2, ,!/) (&lt;&lt;- 1,2, ,*!) 

leaving the origin invariant. Accordingly, as we solve (5) in the subsequent paragraphs for 
various groups (&), we determine at the same time, for each particular group (b), the group of 
transformations (a), each of which is commutative with every transformation (b). Thus we 
find, for example, in 6, that the group of transformations commutative with the group of 
motions about the origin in the plane is 



where r 2 = y\ + yl and RI, R 2 are analytic functions of r 2 , not both zero. 
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The condition | ki, ^ is necessary and sufficient, not merely that X and Y 
coincide only when z is real, but also that the 2n equations II can be solved 
uniquely for xi , x 2 , , x n , y\, y*, , y n and hence the coordinates of z 
found in terms of those of X and Y . In other words, the correspondence 
between the ordered point pairs X - Y and the points z under a representation 

II is one-to-one without exception in the finite domain. 

5. The affine and projective groups. In order that the representation II 
be invariant under the n transformations 

z t =-z q , z( = Zi (q = 1,2, ,n), 

where in each transformation i takes on all the integral values from 1 to n 
except the fixed value q, it is necessary that &# = 0, i ^ j . If the resulting 
representation is to be unchanged by each of the transformations 

Z a = Z T , Z T = Zq, Z t = Zi ( q ^ T ) , 

where in each transformation q and r are fixed and i takes on the remaining 
values from 1 to n, it follows that ku = k^ == k nn . A necessary 
condition, therefore, that II be invariant under the affine group is that Ic^ = , 
i 9^ j , and & = k for all i . That this condition is also sufficient can be easily 
verified. 

If, instead of all real affine transformations, merely those for which the 
determinant of the coefficients is positive had been admitted, the result would 
have been the same. 

The representations X -&gt; 7 of z invariant under the group of either direct or 
general affine transformations are 

III Xi = Xi + AT/I , Yi = Xi kyi (i = 1 , 2 , , n) , 

ivhere k is an arbitrary constant, ^ . 

The two inverse representations for which k == 1 are the representations 
of Marie. 

If III is to be invariant under the projective group, it must be unchanged by 
the transformation 

z( = -, z i = - (i = 2, ,?i). 

Zi Zi 

But then the first of the equations (3) necessitates that k 2 = - - 1 . 

There is no representation X -&gt; Y of z which is invariant under the projective 
group. 

6. The group of motions. The treatment in this case varies with the dimen 
sionality of the space. 

A. n = 1 . The group of motions is the group of translations. According 
to I, the invariant representations are 

IVA X l = x 1 + y 1 $(y 1 ), Y 1 = Xl - yi $ ( yi ) , 
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where $ (2/1) is a real analytic function and the equation $(7/1)= - $ ( 2/1 ) 
is satisfied by no real value of ?/i except perhaps t/i = . 
B. n = 2 . The representation must be of the form 

(6) Xi = Xi + Fi (y lt 2/2) , Y t ^xt + Fi(-y lt -y^ (1=1,2), 
and be invariant under the transformations 

z{ == cos (j) zi sin &lt;j&gt; 22 + i , 2 = sin &lt;/&gt; 21 + cos &lt;j&gt; 22 + 02 

That is, FI ( 2/1 , 7/2 ) , F? ( y\ , 2/2 ) must be solutions of the system of equations, 
obtained from (5), 

FI ( 2/1 , 2/2 ) = cos &lt; F! ( 2/1 , 2/2 ) - sin F 2 ( y\ , y&lt;i ) , 
F 2 (2/i. 2/0 = sin &lt;f&gt; FI (2/1, 2/2) + cos 0F 2 (2/1, 2/2), 



where 

2/i = cos &lt;/&gt;2/i- sin &lt; 7/2 , 2/2 = sin 2/1 + cos &lt; 2/2 . 

For (j&gt; IT , equations (7) become the conditions that FI , F 2 be odd functions 
of 2/1 , 2/2 Consequently, if F is written in the form 

r, ( Fj(yi,y z ) + Fj(yi, - 7/2) , Fj(y\ 

fi(yi,yz) = 



(t=l,2), 

the first quotient on the right is even in y 2 , odd in y\ and hence identically 
zero for 2/1 = , whereas the second is even in y\ , odd in 2/2 and so identically 
zero for 2/2 = 0. Accordingly, 

(8) ^-(2/1,2/2) =2/i^ (2/i&gt;2/2) +2/2^(2/1,2/2) ( =1,2), 

where each of the functions jPj, F?, F2, ^2 is analytic and is even in j/i, in 
2/2 , and hence in 2/1,2/2- 

In the equations obtained from (7) by substituting for FI and F 2 their 
values as given by (8), replace by --&lt;/&gt;, y\ by 2/1 , and combine by addition 
and subtraction each of the resulting equations with that from which it arose. 
The system obtained is 



y(F( (2/i, 2/2) = yicosjF i (2/1, 2/2) - 2/2 sin &lt; Fl ( yi , ?/ 2 ), 
= ?/i sin 0Fi( 2/i, 2/2) +2/2 cos &lt;t&gt;Fl( yi 



y iF a (y{, 2/2) = 2/iCOs&lt;/&gt;F 2 (2/i,2/2) + 2/2 sin 4 F? ( T/I , 2/2 ) , 
2/2 -P"i (2/1,2/2) = - 2/i sin 0^2 (2/1, 2/2) + 2/2 cos FJ (?/i, 2/2) . 
These equations reduce, for &lt;f&gt; = ir/2 , to 

(10) ^2(2/1,2/2) = --^(2/2,2/1), ^(2/1,2/2) =^(2/2,2/1). 

When the values for F 2 and ^2 given by (10) are substituted in (9) and y 2 is 
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set equal to zero, the four resulting equations yield immediately the identities 

F( (yi cos &lt;j), yi sin 0) == F{ (y sin &lt;, yi cos &lt;), 
F [ (yi cos &lt;f&gt;, y l sin &lt;f&gt;) == /?"/ (^ sin $ , y l cos ) , 

in the independent variables y\ and &lt;/&gt; . Consequently, F{ (yi, y 2 ) and 
-f 1 ! ( 2/i , 2/2) are symmetric functions of yi , y 2 , and (10) becomes 

(11) F 2 ( yi ,y 2 )= -Fl(y l&gt;y ,}, ^ (2/1,2/2) -- F{ (y lt y^. 
But the system (9) then reduces to 

F((y(,y 2 ) = F 1 (y 1 ,y 2 ), Fl(y lt y 2 ) = F" 1 (y l ,y t ). 

That is, F{ and F [ are absolute invariants of the group of rotations about the 
origin, and hence must be of the form 

(12) F( = R 1 (r a ), F [ = R,(f-), 
where 

r~ = y\ + yl. 

The necessary conditions (8), (11), (12) that -P\ ( 2/1 , 2/2 ) , ^2 ( 2/1 , 2/2 ) be 
solutions of (7) are shown sufficient by direct substitution. 

The re-presentations X - Y of z invariant under the group of motions in the 
plane are 

Z 1 = ar 1 + 2/lJ R 1 (r 2 ) + 7/ 2 ^ 2 (r 2 ), Y, = Xl - y, R, (r 2 ) - y t R 2 (r 2 ) , 
^2 = xi - yi #2 (r 2 ) + 2/ 2 t (r 2 ) , F 2 = .r 2 + ^ /? 2 (r 2 ) - y z R, (r 2 ) , 

r 2 = ?/? + y\, and RI, R* are real functions of r 2 , analytic for all finite 
values and not vanishing simultaneously except perhaps when r 2 = . 

C. n i? 3 . In case n = 3 , the functions FI , F 2 , F s in the representation 

Xi = Xi + Fi (yi, 2/2,2/3), Yi = Xi + Fi(- y lt - y z , - 2/3) 

(*=1,2,3) 
are to be determined from the system of equations 

(14) F i ( y (, y 2 ,y z } = j^auFj (2/1, 2/2, 2/3) (i=l,2,3), 

;=i 

where 

3 

y i = I&gt;;;2/y (* = 1,2,3), 

3=1 

and the determinant | a^ \ is orthogonal and has the value + 1 . 

In case of an arbitrary rotation about the axis of z 3 , 2/3 = 2/s and y\ , y z are 
transformed as for n = 2 . The last equation of (14) reduces to 

^3 (y i, 2/2, 2/3) = F s (yi, ?/2, 2/s), 
and the first two to the equations (7), with the variable y z added in each 
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function. Consequently, 



y, Fl (y\ + yl, 
(15) F, -- -yiFl(yl + yl 



Corresponding respectively to the rotations about the axes of 21 and zi, 
there are two other sets of equations like (15) and obtainable from (15) by 
permuting the subscripts 1,2,3 cyclicly. From the three sets of equations 
it follows that F [ == Fl = F i = 0; for example, since/ 1 ! == F{" (yl + yl,yi}, 
F\ is an even function of t/ 2 , whence, by the first equation of (15), F z = 0. 
Therefore 

Fi = Vi F j (y* i +yl,y i } = y t F , (y? + y], y,), 

where i , j , k are to be permuted cyclicly through the values 1,2,3. Thus 

F{ --F 2 = F 3 = R(r 2 ), where r 2 = y\ + yl + yl, 
and (13) becomes 

X i = x i + y i R(r s ), Y { = x&gt; - y&lt; R(r*) (i==l,2,3). 

Substitution shows that the functions Fi in this representation satisfy (14). 

This result for n = 3 can be generalized by mathematical induction to 
hold for the case n &gt; 3 . 

The representations X Y of z invariant under the group of motions in n- 
dimensional space, where n ^ 3 , are 

IVC Xi = Xi + yiR(r s ), Y^Xi-y.R^-) (i = 1 ,2 , , n) , 

where 

r z = yl + yl+ + yf,, 

and R ( r z ) is a real function analytic for all finite values and not zero except perhaps 
when r 2 = . 

For the group of motions and reflections the result incorporated in IVC holds 
for all values of n. To establish the truth of this statement for n is 3 , it is 
sufficient to show that the representation IVC is invariant under a reflection 
in one of the coordinate hyperplanes. If n == 2, the representation must be 
of the form IVB and be invariant under z( = z\,z 2 = z 2 ; then R 2 (r 2 ) =0, 
and conversely. Finally, IVA is invariant under zj = zi if and only if 
$ ( yi ) is an even function of yi , i.e., is of the form R ( yl ) . 

7. The group of direct transformations of similarity. Necessary and suffi 
cient that the representation X -&gt; Y be unchanged by the direct transforma 
tions of similarity in the cases n, = l,n = 2,nis3is the requirement that 
IVA, IVB, IVC be respectively invariant under an arbitrary stretching from 
the origin of positive ratio. According to 4, the arbitrary functions involved 
in the representations IV are, then, all constants, and conversely. 
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The representations X &gt; Y of z invariant under the group of direct transforma 
tions of similarity are 



k ^ , if n = 1 , 

Xi = o-i + Icy i + Zi/z , Y l = Xi- kyi - ly z , 
VB X z = Xz - lyi + kyz , Y z = x z + lyi ky z , 

A^ + P^o, if n = 2, 
Xf == Xi + ky f , Yi == Xi - ky i} 

k ^ 0, i == 1,2, ,n, if n ^ 3. 

Except in the plane, these representations are identical with the representa 
tions III invariant under the affine group. 

For the group of direct and indirect transformations of similarity the representa 
tions are for all n given by VC and are identical with those peculiar to the affine 
group. 

The two inverse representations VB, for which k = and I = =b 1 , namely 

VT Xi = xi T/o , 3 i = .TI T y z , 

Xz = Xz =F 7/1 , Y z = .T 2 ?/l , 

are &lt;^e representations of Laguerre. From their geometrical definition ( 1), 
it is evident that these representations are invariant under the direct circular 
transformations. It can be shown that they are the only representations with 
this property. 

8. Representations inverted by a set of transformations. Given a group, 
G, of direct transformations of a certain type and the set, S, of indirect 
transformations of the same type, to determine, among the representations 
invariant under G, those which, though not invariant under S, are merely 
inverted by the transformations of S . 

It is geometrically evident that a Laguerre representation is inverted by an 
indirect circular transformation. Of the groups G other than the circular 
group, every one under which invariant representations exist is linear and 
contains the group of translations. Hence the representation must be of the 
form I. In order that it be inverted by a linear transformation (2), it is 
necessary and sufficient that the functions Fi satisfy the system of equations 

Fi (- y[, - y 2 , , - y n} 

(16) = ^(F^y^yz, , y n } , , F n ( yi ,y 2 , -,*/)) 

(i= 1,2, ,n), 

In 4-7 it has been shown that, when G is any one of the following groups 
of direct transformations: homothetic group, affine group, group of motions 
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(n = 3), similarity group (n ^ 2), all the representations invariant under 
G are also invariant under S . It is true also that no one of the representations 
IVA, invariant under the group of motions for n - - 1 , is inverted by any 
transformation of S, as can readily be proved by applying (16) with z( - - zi 
as the transformation used; but in this case, not all the representations are 
invariant under S (6). 

If G is the group of motions (or of direct transformations of similarity) in 
the plane, the functions Fi to be used in (16) are as in IVB (or VB). It is 
sufficient to choose as the transformation a reflection in a coordinate axis. 
Then equations (16) reduce to R\ == (or k = 0) . 

Of the representations IVB invariant under the group of motion* in the plane, 
only those for which R 2 = are invariant under reflections, irhrrcas only those 
for which RI == are inverted by every reflection. 

Of the representations VB invariant under the direct transformations of 
similarity, only those for which I = are invariant under the indirect trans 
formations of similarity, whereas only those for which k = are inverted by every 
such transformation. 

HARVARD UNIVERSITY, 
CAMBRIDGE, MASS. 



NON-LOXODROMIC SUBSTITUTIONS AND GROUPS 
INN DIMENSIONS* 

BY 

EDWARD B. VAN VLECK 
Linear substitutions 

(1) * - ^ 

ex + a 

are commonly divided into two classes, loxodromic and non-loxodromic, and 
the non-loxodromic substitutions are further classified as elliptic, parabolic, 
and hyperbolic. So far as I am aware, no corresponding classification of 
projective substitutions in n -- 1 nonhomogeneous or n homogeneous variables 
has been suggested.! The classification which is introduced below is based 
upon the formulation of a projective substitution under the shape of a homo 
geneous substitution with determinant + 1 . If the substitution (1) is 
written in homogeneous form 

x( = ax\ + bx , x 2 = cx + dx 2 

with unit determinant, the substitution is non-loxodromic when, and only 
when, a + d is real; that is, when the characteristic equation 



a \ b 
c d \ 



_ ~ 



is a real equation. The substitution is then hyperbolic, parabolic, or elliptic J 
according as the two roots of this equation are real and distinct, coincident, or 
imaginary. Correspondingly, the projective substitutions are taken ex 
clusively in this paper in homogeneous form, 

X{ = On Xl + dizXi + + ln %n , 

(2) 

x n = a ni xi + a nZ x* + + a nn x n , 



* Presented to the Society, March 25, 1921. 

t Fubini in his Introdusione alia Teoria dei Gruppi discontinui e delle Funzioni automorfe, 
pp. 77-8, has applied the above terms to those substitutions which are movements of 

sg + (*f+a+y+sj_i)-l ( = D, 

into itself. His aims and distinctions are very different from those of this paper. 

t An application of the terms elliptic and hyperbolic differing slightly from the above usage 
of Klein is also to be found in the literature. Cf. Veblen s Projective Geomeiry, vol. 2, pp. 5, 

171, 248. 
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with determinant A equal to + 1 . I call a substitution non-loxodromic, if, when 
so written, it possesses a real characteristic equation, 



/ON 



X 012 a i n 

d22 X 02n 



The characteristic roots are accordingly real or pairs of conjugate imaginaries. 
It is well known that there are just two classes of groups (1) in a single 
variable which consist of non-loxodromic substitutions without a common 
pole. Either all the substitutions of the group leave invariant a circle in the 
complex .r-plane and the group is called a "group with a principal circle" 
(Hauptkreis Gruppe), or the group consists entirely of elliptic substitutions. 
In the former case the group can be so transformed that the circle becomes the 
real axis. We have then a canonical form for the group in which its substitu 
tions have real coefficients and unit-determinant, and, conversely, any group 
of this character is non-loxodromic. A second canonical form is 

(3 ) x 



, 

( c di ) x + ( a oi ) 

which leaves the unit circle invariant. The second class of non-loxodromic 
groups the elliptic groups can be put into a canonical form in which the 
substitutions are all of form 

(4) af^^^++L (* 

(c di)x 



and the group can be interpreted as a group of rotations of a sphere into itself. 
When the number of homogeneous variables is greater than 2, there are, 
of course, more varieties of non-loxodromic substitutions to be distinguished 
than the familiar hyperbolic, elliptic and parabolic types, and it was perhaps 
scarcely to be expected that the result summarized in the preceding paragraph 
would have its exact analogue. Nevertheless an exactly similar result is 
obtained. There are still just two types of non-loxodromic groups. A 
limitation upon this statement (perhaps unnecessary) is introduced by the 
proof which requires that the non-loxodromic group should contain at least 
one substitution having a characteristic equation with n distinct roots. Corre 
sponding to the hypothesis of the preceding paragraph that the substitutions 
(1) should not have a common pole, we have now the requirement that our 
non-loxodromic group of substitutions (2) in n homogeneous variables should 
not leave invariant a linear space of less than n -- 1 (complex) dimensions; in 
other words, should not transform into itself a hyperplane nor a space common 
to two or more hyperplanes in our projective space S n -i 
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One normal form obtained for the first class of non-loxodromic groups is that 
in which all substitutions (2) of the group have real coefficients and a deter 
minant equal to + 1 . In this normal form the group obviously leaves in 
variant the real space embedded in the complex space S n -i of the n homo 
geneous complex coordinates which is of 2n 2 real dimensions. Any pro- 
jective (n -- 1 )-dimensional image of this real space within the complex space 
has been called a "chain" (von Staudt, Fieri), and every non-loxodromic group 
of our first class accordingly leaves some chain invariant. If n -- 1 is even, 
all projective substitutions which leave a chain invariant form a non- 
loxodromic group of the first class. When n -- 1 is odd, this is not the case. 
The most inclusive non-loxodromic group which leaves the chain invariant is 
then a subgroup of the set of all substitutions (2) which leave the chain 
invariant.* 

The second class of non-loxodromic groups is novel and interesting. It 
occurs only when n is an even number 2m . Any group of this class can be so 
transformed that all its substitutions have simultaneously the canonical form 

Xzi-l = (2i-l, 1 Xl + dzi-l, 2 o) 

+ + (2i-l, 2m- 1 Xzm-l + dzi-l, 2m %2m ) , 

(5) 

%2i ~- ( ~ a 2i 1, 2 -I l + &2i 1, 1 #2 ) 

+ + ( O.zi-1, 2m X-Zml + Q-Zi1, 2m-l % 



where a,-,- denotes the conjugate of a;y. Thus the determinant of the sub 
stitution consists of m 2 blocks of form 



(6) 



(odd i and j ) , 

3+1 



similar to the block of coefficients of (4). It is shown that when a determinant 
has this structure, it is necessarily positive or zero.| 

For any (non-singular) substitution of the second class the_ elementary 
divisors of (3) occur only in conjugate pairs (X Xi)% (X Xi) s of equal 
degree, this statement applying as well for real roots X; which furnish pairs of 
equal elementary divisors. Conversely, a non-singular substitution which 
fulfills this requirement is one of the second class and may be transformed into 
the normal form (5). Such substitutions will be called elliptic (restrictedly 
elliptic, when the characteristic roots are m distinct pairs of conjugate imag- 



* J. W. Young in these Transactions, vol. 11 (1910), p. 280, investigates the neces 
sary and sufficient condition that a collineation ( n = 3 ) shall leave a two-dimensional chain 
invariant. 

t This result has been known for the special case in which all the elements of the deter 
minant are real. Cf. Kowalewski, Einfuhrung in die Determinantentheorie, pp. 156-158. 
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inaries). The characteristic property of an elliptic substitution in its normal 
form (5) is that it converts any pair of points 

P = (.TI , .r 2 ; ; x zm -i , zim ) , Q = ( xz, xi; ;-- x 2m , x 2m -i ) 

into another pair of points whose coordinates are similarly related, any sub 
stitution having this property being necessarily of form (5). The invariant 
points of any substitution (5) occur in such pairs; in particular, the 2m in 
variant points of a restrictedly elliptic substitution, taken in normal form, 
consist of m distinct pairs. 

It will be noticed that in (5) the 2m equations of the substitution are associ 
ated in pairs. In its normal form the most general group of the second or 
elliptic type comprises all non-singular elliptic substitutions (5) with the same 
pairing of the coordinates. 

1. DERIVATION OF TWO STANDARD FORMS FOR NON-LOXODROMIC GROUPS 

As remarked in the introduction, we will suppose our projective substitutions 
to be written throughout in homogeneous form (2) with determinant 

an a 12 

(7) As 

equal to + 1 . Obviously the necessary and sufficient condition that a sub 
stitution (2) should be non-loxodromic by the above definition is that the sum 
of all the principal minors of each order r S n should be real. 

When the roots of the characteristic equation (3) are all distinct, the sub 
stitution can be transformed through an appropriate substitution T into the 
canonical form 

(8) x t =\&lt;Xi (t=l,-..,n) (5,). 

We will treat only groups of non-loxodromic substitutions which contain at 
least one such substitution Si, and we will suppose that the group has been 
so transformed that Si has the form (8). Further, it will be assumed that the 
subscripts i have been so assigned in (8) that for i = 1 , 2 , , 2m ^ n the 
Xi are successively m pairs of conjugate imaginaries, while the remaining Xj 
are real. 

Consider now any non-loxodromic substitution S , given by (2), which com 
bines with Si so as to generate a non-loxodromic group. The sum of principal 
minors flu + 022 + + a-nn is real, likewise the corresponding sum 

X* an + X* a 22 + + X a nn = r k 

for SSi . Denoting these real sums by rk and taking fc = , 1 , , n 1 , 
we have a system of n equations linear in the an. The determinant of the 
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system is the product II (X; X&gt;) which cannot vanish in consequence of our 
hypothesis of the distinctness of the roots of Si . There is therefore a unique 
solution for the a fi in terms of the X; , n . The first 2m coefficients A? in each 
equation are successively pairs of conjugate imaginaries while the remaining 
ones are real. If, therefore, we exchange each of the first m odd terms on the 
left side of each equation with the following even term, we have a system of 
equations which is the conjugate of the preceding system, and a;+i, ;+i 
being interchanged for odd i &lt; 2m . Consequently an and a i+ i, i+ i are con 
jugate imaginaries for odd values of i &lt; 2m, while the an are real for i &gt; 2m. 
Consider next the combination of (2) with any substitution 

(9) x i = bn xi + b&gt; 2 a-a + + b in x n ( S ) 

of the same non-Ioxodromic group. The product S S is a substitution 

x i = Bn xi + B i2 x 2 + + B in x n (i= 1, , n) , 
in which 

(10) BH = bn an + b iz a Z j + + b in a nj . 

Suppose first m = 1 . Then B\\ , B^ are conjugate imaginaries, and the same 
is true of the corresponding coefficients 

-B n = Xl 6ll Oil + AS &12 21 + + X* bin Ctnl , 
BZZ = Xi 621 Ql2 ~h X 2 "22 22 ~T" ~T" X re bo n dnl 

in S S*S . Take k = 0,l,---,n--l. We have then in (11) two systems of 
equations linear in the bn an and 2 i a & respectively with a common deter 
minant II (Xi Xy) not zero. In the two systems corresponding coefficients 
X* are the same, the first m pairs being conjugate imaginaries and the 
remaining ones real, while the corresponding left members are conjugate 
imaginaries. We may therefore regard the two systems of equations as con 
jugate imaginary systems in which the bn an and 6 2l ; a i2 correspond to one 
another for i &gt; 2m, while for odd i &lt; 2m they correspond respectively to 
bz, i+\ di+i, 2 and &i, ,-+i a; + i, i respectively. The solution of the two systems 
therefore gives 

(12) bn an = b 2 , i+i a,- + i, 2 (odd i &lt; 2m) , 

(13) &2i a; 2 = bi, i+i cii+i, i (odd i &lt; 2m), 

(14) biitta = btittiz (i &gt; 2m). 

We will now make the Fundamental Hypothesis that for each value of 
i =i n there is in the non-loxodromic group at least one substitution (2) for 
which an is not zero and also one for which an is not, leaving for later con- 
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sideration the geometrical significance of this hypothesis. Choose then two 
substitutions S, S for which 621 i2 7^ 0. Then by (13) 12 and a 2 i are not 
zero. Taking next S identical with S , we see by (13) that ai 2 021 is its own 
conjugate and hence is real. A very different outcome is obtained according 
as we suppose that this product is positive or negative. The two corresponding 
cases we will refer to as case 1 and case 2 respectively. 

To normalize the groups in these two cases it will suffice to make a simple 
change of variable 
(15) x i = p i y i , x i = piy i (i = 1 , , n) 

with a proper determination of the pi, restoring for convenience the letter x 
in place of y after the transformation. This change does not affect our 
canonical substitution S\ nor the values of the an in 5 , while the new value of a^ 
differs from the old value by the factor pj/pt . Also the values of the principal 
minors of (7) and the characteristic equation (3) remain unaltered. We will 
so choose pi , p 2 that in the substitution S of the preceding paragraph, for which 
i2 0-11^ , we will have a J2 = a 2 i in case 1 and a u = - a 2 i in case 2. It 
follows then from (13) that we have simultaneously b^ = 21 in case 1, 
&i2 = - 621 in case 2 for every substitution S of the group. 

Next take any odd value of i between 1 and 2m and choose two substitutions 
S , S for which b^an is not zero. Then in consequence of (12) a l+ i, 2 and 
Oz, i+i are not zero. By properly choosing p 1+ i/p; we may make a,i and 
a; + i, 2 in S the conjugates of one another. If then S denotes any substitution 
whatsoever in the group (including S itself), it follows from (12) that for all 
substitutions S we have simultaneously 61 = 6 2 , ,-+iforoddi &lt; 2m. Taking 
an S for which bn 3^ 0, we see again from (12) that for every substitution S 
of the group we have an = a,-+i, 2 for odd i &lt; 2m . 

Consider next the two coefficients in S S*S 

#12 = Xi b n i2 + Ag 612 ff, 2 + + A* b ln c n2 , 

B ( 21 -= Aj &21 Oil + Al 022 021 + + X* b 2n Clni . 

These are conjugate imaginaries in case 1 and negative conjugates in case 2. 
If we set k = 0, 1 , , n -- 1 , we have two systems of equations linear in 
&ii o i2 and b Z i an respectively. Reasoning in exactly the same manner as 
above for equations (11), we obtain the following relations between any two 
substitutions of our group: 



(16) " iz 2 +l i+1 1 (odd i &lt; 2m ) 

t&gt;naa =-- D 2 , i + ia i+ i, i (case 2) 



(17) 2i ail ~- = l i +1 i+l 2 (odd i &lt; 2m 
o-ii an = PI. i+i flf+i. 2 (case 2) 

(18) 6w -&*_ (easel) 
OiiOj-2 = bzittii (case 2) 
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For odd i &lt; 2m our normalization has already made. a,i = a+i, 2, 
bu = 6 2 , 1+1. If, therefore, we select for (17) a substitution S in which the 
former pair is not zero and for (16) a substitution S in which the latter pair 
is not, we get from these equations 6 2i = bi, ;+i, a i2 = a f+1&gt; i in case 1, and 
&2i = -- bi, i+i, a t - 2 = a,+i, i in case 2. These results apply to any sub- 
stitutipns S , S in our group. Lastly, if we take a pair of substitutions with 
bu an ^ and divide (18) by (14), we get 

(19) Oi2 = a i i (i &gt; 2m, case 1), 

an (lit 

while in case 2 these two ratios must be the negative of each other. But the 
latter is impossible, since a quantity cannot equal the negative of the conjugate 
of its reciprocal. We thus reach the important conclusion that case 2 is possible 
only when i = 2 in . 

In case 1 equation (19) makes an a a . We have not yet disposed of 
Pi for i &gt; 2m, and we will now so take it that the arguments of an and a i2 
in the particular S under consideration shall be the negative of each other. 
Then we have an = a^, whence it follows by (18) that in case 1 we have 
b&lt;2i = bu (i &gt; 2m) for every substitution S in the group. By taking next an 
S with bu 7^- 0, it follows again from (IS) that for every substitution S in the 
group we have a a = dn (i &gt; 2m) in case 1. 

We have thus completely determined the character of the first two rows and 
columns of the coefficients (7) of every substitution in our non-loxodromic 
group. The character of the remaining rows and columns may now be rapidly 
fixed as follows. We know that for odd j &lt; 2m the coefficients 

B ( i) = X* fen au + A fei2 a 2j + + \n bj. n a nj , 
(20) 

B ( 2, 3+1 = ^1 bzi Ol, j+l + X 2 feo2 2, j+l + + X n " b 2n 0-n, j+1 

in S SiS are conjugate imaginary in both cases 1 and 2. Putting k = 0, 1, 
, n -- 1 , we have two systems of equations for the n quantities bu a,y and 
bzidi, j+i (i = 1, , n) respectively. Since X*, X* + i are conjugate imag- 
inaries for odd i &lt; 2m and X t - is real for i &gt; 2m , it follows in the same manner 
as before that we have the following pairs of conjugate imaginaries: 

fen- an and 6 2 , ; +1 a i+l , j+l 



&lt; 

zi di, y+i and 61, ;+i a i+ i, j 
bu an and b 2 i o,-, y+i (odd j &lt; 2m and i &gt; 2m) . 

Now for odd i &lt; 2m we had already bu = 6 2 , i+i in both cases 1 and 2, 
bi, 1+1 = b~ 2 i in case 1 and &i, ,+i = - 6 2 i in case 2; while for i &gt; 2m we had 
bu = bzt in case 1. Since also by our Fundamental Hypothesis we could select 
S so that a prescribed one of these quantities should not be zero, it follows 
that for every substitution S of our group we have the relations: 

Trans. Am. Math. Soc. 18. 
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. Case 1. 

= ai+1 m (odd j &lt; 2m; odd i &lt; 2m) 

Q&gt;i, j+l Oj +l, j 

dij = a;, y+i (odd j &lt; 2m; odd i &gt; 2m) 

Case 2. 

ttij = O,+i, y+i . 

(odd j and i ) 

&lt;3;, j+l = Qi+1, y + i 

The character of the coefficients has just been completely fixed for case 2 
and for the first 2m columns of (7) in case 1. In case 2 the determinant (7) 
consists of m 2 blocks of form (6), as stated in the introduction. 

It remains only to fix in case 1 the law of the coefficients subsequent to the 
2mth column of (7). Now it has already been found that B fj and Biff are 
conjugates for j &gt; 2m , and their values will be given by (20) if the subscript 
j + l in the second equation is replaced throughout by j. Taking then 
k = 0, 1 , , n -- 1 in the two equations of (20) and arguing in the same 
manner as before, we obtain for odd i &lt; 2m the conjugate imaginary pairs 
b\{ aij and 62, i+i a+i, j, b&lt;n a^j and 61, ,- + i o,-+i, j, while for i &gt; 2m we have as 
conjugates buOij and 62; ;.,. For the given odd i &lt; m a substitution S 
can be employed for which the conjugate elements bn, b 2 , t -+i or bo i; bi, z - +1 
are not zero, and for i &gt; 2m a substitution for which the conjugates bn, 6 2 i 
are not. Hence, in every substitution S of our group, a f j and a l+ i, } - are 
conjugates for odd i &lt; 2m, j &gt; 2m, while for i &gt; 2m, j &gt; 2???. every a f j is 
self-conjugate and consequently real. 

The completed result which has thus been reached in case 1 may now be 
summed up as follows. On the assumption that Si possesses distinct charac 
teristic roots \i and at least one pair of roots which are conjugate imaginaries, 
a normal form for a non-loxodromic group satisfying our Fundamental Hypoth 
esis has been obtained in which every substitution has the following structure. 
Its determinant (7) consists of four sections. The minor common to the first 
2m rows and columns is made up of m 2 blocks of form 

/ i i i \ 
(odd i and j ) , 

j+\ ttij 

and the minor common to the last n 2m rows and columns contains only 
real elements. For i &lt; 2m, j &gt; 2m the elements occur in conjugate pairs 
an, a i+ i, j (odd i) , and for i &gt; 2m, j &lt; 2m in conjugate pairs aij, a it 7 - +1 
(odd j ) . This normal form corresponds to the form (3 ) given in the intro 
duction for m = 1 . The simple change of variable 

(i &lt; m), 
(i &gt; 2m) 
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throws the group at once into a second and simpler normal form in which the 
coefficients are all real. 

When m = 0, the above derivation of the group fails, but the real normal 
form just indicated is still valid and can be rapidly derived as follows. By 
hypothesis, all the characteristic roots X; of Si are now real as well as distinct, 
and accordingly in every substitution (2) of the group all the an are real. 
Since the left members of the system obtained by putting k = 0,1, ,n 1 
in the first equation of (11) are real, all the 6 lt - a a resulting from the solution 
of the system must be real. Further, by our Fundamental Hypothesis, for 
each value of i there is some S for which fe lt - is not zero, and in this particular 
S by a change of variable (15) with the proper value of p;/pi we may make 
bn (ij^ 1) real. It follows then that in every substitution S of the group 
an is real. By choosing next an S having a,i ^ , it follows again from the 
reality of bnan that every substitution S of the group has a real &H. In 
this wise all the elements of the first row and column of (7) may be made 
simultaneously real for all substitutions of the group. Turning next to the 
system of equations obtained by putting k --= 0, 1, , n -- 1 in the first 
equation of (20), we argue in the same manner that every bn a^ is real and 
hence every a^- . Thus by the simple change of variable (15) we can throw 
the group into a normal form in which all of its substitutions are real. 

We turn now to consider the significance of our Fundamental Hypothesis 
that for each value of i there is some substitution of the group for which 
O-HT^ 0. Make for a moment the contrary hypothesis and suppose that 
for some fixed j we have a\j = in every substitution of the group.* Then in 
(20) we have B ( $ = for this value of j . Taking k = 0, 1 , , n - 1 , 
we have a homogeneous system of equations giving bn a ,7 = (i = 1 , ,n) 
for any two substitutions S , S of the group. Hence for any value of i we 
must have either &H = for every substitution S of the group or an = 
for every substitution S of the group. Suppose now that there are exactly 
r values of j for which a-ij is zero in every substitution of the group. Then for 
each of these r values of j and for any other value of i not one of these r values 
we must have a,y = in every substitution of the group. There will be no 
loss of generality if we assume that in the last r columns of (7) and in no others 
we have a\j = , since this can be attained f by merely assigning the subscripts 
properly to our n variables a; . Then all the elements of (7) common to these 
columns and to the first n r rows will be zero. Consequently the first 
n r equations of (2) contain only the variables x\ , , .r n _ r . This reveals 
that the space of r 1 (complex) dimensions (or a point, if r = 1 ) obtained 

*If some of the a^ are zero for every substitution of the group, a parallel reasoning applies 
with like result. 

t Unless au = for every substitution, when we use the first and the last r 1 columns 
in place of the last r columns. 
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by putting these coordinates equal to zero is carried over into itself by all 
substitutions of the group. Conversely, it is evident that if this is the case, 
the elements of (7) common to the last r columns and the first n r rows 
must be zero for all substitutions of the group. The significance of our 
Fundamental Hypothesis is therefore to forbid that all substitutions of the 
group should have such an invariant space when Si is taken in form (8). 

The hyperplanes a;,- = are the only invariant hyperplanes of Si since the 
multipliers A; in (8) are by hypothesis all distinct. The linear spaces common 
to n r of these invariant hyperplanes are also invariant under Si , and it 
is readily seen that they are the only spaces of r -- 1 dimensions invariant 
under Si which are common to n r hyperplanes. For consider an invariant 
space which is defined by linearly independent hyperplanes 

n 

Ui=Y^ C H *J = (i=l, ,n- r ) . 

&gt;=i 

If we take these planes as our coordinate planes along with r other independent 
hyperplanes, then Si in terms of the new ^-coordinates will be specified by n 
equations of form (2) with the Ui in place of the Xi. In the corresponding 
determinant (7) the elements common to the last r columns and the first 
n r rows must be zero since the space common to the hyperplanes u f = 
(i = 1, , n r) is invariant. Accordingly there will be then ?i r 
linearly independent combinations of u\ , , u n - r corresponding to n r 
characteristic roots X t , and these, except for constant factors, must be identical 
with n r coordinates a;,- which figure in (S). Hence the invariant space 
common to the hyperplanes Ui = ( i = 1 , , n r ) can also be specified 
by putting n r coordinates Xi equal to zero. We thus see that our Funda 
mental Hypothesis is identical ivith the requirement that the substitutions of our 
group should not admit a common invariant hyperplane nor an invariant space 
common to two or more hyperplanes. This requirement is, of course, unaltered 
when Si with the group is transformed out of its normal form. It corresponds 
to the condition mentioned in the introduction, that a non-loxodromic group 
of substitutions (1) should not have a common pole. 

Up to the present point the development has involved only the reality of 
the characteristic equations (3) and in no wise the signs of the determinants 
(7). The conclusions reached therefore apply not only to groups of non- 
loxodromic substitutions but likewise to non-singular groups of substitutions 
whose characteristic equations are real. For the latter also, under the limitations 
specified, there are just two classes of groups, and these can be transformed 
into the two normal forms which have been given. Further, the totality of 
substitutions of either form, with or without the restriction of being non- 
loxodromic, form a group. This is evident in the case of substitutions of the 
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first kind since the totality of real, non-singular substitutions (2) form a group, 
and the same holds true if the requirement is added that the determinants of 
the substitutions shall be positive or that they shall be of value + 1 . In 
the next section it will be shown that the totality of non-singular substitutions 
of the second kind also form a group, in this case necessarily non-loxodromic. 

2. ON THE NATURE OF THE TWO CLASSES OF NON-LOXODROMIC GROUPS 

A real protective substitution (2) of determinant A can be reduced to a form 
having the determinant unity by dividing each coefficient in (2)byA 1/n . If 
A is positive, a real nth root may always be chosen so that the substitution 
remains in real form, but if A is negative, this is only possible when n is odd. A 
real projective substitution in space of n -- 1 (real or complex) dimensions is 
therefore necessarily non-loxodromic when the number of dimensions is even, 
but when it is odd, only if the determinant of the substitution is positive. 

In accordance with 1 the general non-loxodromic group of the first class, 
expressed in normal form, consists of all real substitutions (2) of determinant 1 . 
Hence it comprises all non-loxodromic substitutions which leave invariant 
the real space of the n homogeneous variables which lies within the complex 
S n -i When n - - 1 is even, all real and non-singular projective substitutions 
of the n variables are included. If the group by transformation through any 
substitution is thrown out of normal form, the real space is converted into an 
invariant "chain" for the transformed group. 

For the special case n = 2 it is known that the poles of all hyperbolic and 
parabolic substitutions in a non-loxodromic group of the first class lie upon the 
invariant real axis or circle, while the poles of any elliptic substitution of the 
group lie symmetrically with respect to the real axis or circle. We can easily 
see that there is a corresponding generalization for greater values of n . Con 
sider, for example, any non-loxodromic group of the first class which contains 
a substitution S whose characteristic roots are all real and distinct. Such a 
substitution will be called hyperbolic since it is the natural generalization of the 
hyperbolic substitution for n = 2.* Let the group be transformed into its 
normal real form. Then the hyperbolic substitution becomes a real sub 
stitution (2) of unit determinant. To each of our n distinct characteristic 
roots A; belongs a linear form, 

Yi = Ci Xi + C 2 Xz + + C n X n , 

which undergoes the substitution 7 == X t Yi in consequence of S . The well 
known equations for determining the coefficients c; of the form, given in (23) 
below, show that the ratios of these coefficients are all real. If we equate 

* It will presently appear that no hyperbolic substitution is included in a non-loxodromic 
group of our second class. 
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these n forms to zero, we obtain a configuration of n real and distinct hyper- 
planes, and the n vertices of this configuration are the poles of the hyperbolic 
substitution. The poles of the substitution are accordingly all real, a con 
clusion which applies simultaneously to all hyperbolic substitutions of the 
group. Hence in a non-loxodromic substitution of the first class the poles of its 
hyperbolic substitutions all lie in the invariant chain of the group. 

Conversely, if we have a set of hyperbolic substitutions whose poles lie in a 
common chain, they generate a non-loxodromic group. For let the group be so 
transformed that the chain becomes the real coordinate space. The poles of 
any hyperbolic substitutions in the group are now real and by a real trans 
formation of the group may be made to be the n-coordinate vertices. The 
substitution then has the real form .? = \i Xi (i -- 1 , , H ) and hence must 
have had also a real form before the last transformation. Consequently all 
hyperbolic substitutions of the group have been made simultaneously real. 
Now by hypothesis our hyperbolic substitutions are of unit-determinant; 
they generate therefore a group consisting only of real substitutions of unit- 
determinant, that is, a group of non-loxodromic substitutions. 

Take now the other extreme case in which the characteristic roots of some 
particular substitution of a non-loxodromic group of the first class are distinct 
pairs of conjugate imaginaries. Such a substitution will be called restricted!;/ 
elliptic and is possible only when n is an even number 2m . By transformation 
of the group the particular substitution can be thrown into the form (8) and be 
used as our substitution Si of the preceding section. Without change of Si 
the group was there brought into a first normal form and then was converted 
into the final real form by the transformation (21). Since the characteristic 
roots of our substitution are by hypothesis all distinct, the invariant points of 
(8) w r ere the ^-coordinate vertices. The transformation (21) converts the 
invariant point for which x Z i~i is the only non-zero coordinate into a point for 
which the ^/-coordinates are all zero except 2/22-1 = X 2i-i, yi i = - .I LV-I, 
while the invariant point having x-a as its only non-zero coordinate is con 
verted into one for which all the ^-coordinates are zero except 7/ 2 i-i = .TO i , 
2/2 i = ixzi- These two points are imaginary points which lie symmetrically 
with respect to the real space of the ^-coordinates; i.e., the space for which the 
ratios of the 2m homogeneous coordinates are real.* Thus the 2m invariant 

* If we think of a complex space Sk as a real space with 2fc real orthogonal axes, two points 
lie symmetrically with respect to the linear space determined by r-coordinate axes if the corre 
sponding r-coordinates of the two points agree, while the remaining coordinates of one point 
differ only in sign from those of the other. Accordingly, in the case before us, if we interpret 
the n complex coordinates yi as nonhomogeneous coordinates for a real space of 2 dimen 
sions, the n = 2m invariant points of our substitution are obviously m pairs of points sym 
metrically situated with respect to the linear space of n dimensions which consists of the points 
for which the yi are all real. When, however, as above, the coordinates are thought of as 
projective, some one of the yi equated to zero let us say y n = specifies the region at 
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points of the substitution Si in its final real form are m pairs of points sym 
metrically situated with respect to this real space of 2m 1 dimensions 
embedded in the complex space of the coordinates having double this number of 
dimensions. This real space is the invariant real space of the normalized 
real group in which our substitution is contained. The other " restrictedly 
elliptic" substitutions of the group, whose characteristic roots are distinct 
pairs of conjugate imaginaries, have become real simultaneously with Si . 
Now any two real forms into which a given substitution can be transformed 
are convertible the one into the other by a real linear change of coordinate 
variables. Such a change in our ^/-coordinates converts a pair of points, 
symmetrically situated with respect to the real space of the coordinate-ratios 
and having corresponding coordinates the conjugates of one another, into 
another like pair of coordinates. Since, furthermore, any restrictedly- 
elliptic substitution of the group could be made our Si, it follows that the 
invariant points of all restrictedly elliptic substitutions of a normalized non- 
loxodromic group of the first species consist of pairs of points which lie sym 
metrically with respect to the real invariant space of the group. If the group 
is transformed through any substitution T , the real invariant space is con 
verted into a chain, and the images of the invariant point-pairs of these sub 
stitutions are converted into pairs of points which may be said to lie sym 
metrically with respect to this chain. 

Turn now to the groups of the second class which are more novel and there 
fore more interesting. We will first prove that any substitution whose de 
terminant consists of m 2 blocks of form (6) has a real characteristic equation. 
To establish this it is necessary to show that in its determinant (7) the sum of 
all the principal minors of any given order r is real. It suffices to prove that 
the principal minors which are not real occur in conjugate pairs. With this 
in view let us 

(I) exchange each odd row and column of (7) with the following even 
row and column, and 

(II) change first the signs of all the elements in the even rows and then the 
signs of the elements in the even columns of A . 

By these two changes every element of (7) is replaced by its conjugate. 
Since the determinant is unaltered in value by the two changes, it must be 
real, being identical with its conjugate. Any principal minor of (7) formed 

infinity and the nonhomogeneous coordinates will be n 1 ratios y\ly n , , Va-i/Vn The 
difficulty arising from the presence of pairs of points for which y n equals zero may be removed 
by regarding such a pair as the limit of another pair with the same coordinates except y n , 
which is replaced by a real number h approaching zero. If, then, for h ^ the two points 
are symmetrically situated with respect to the real space for which the (w 1) -coordinate 
ratios are real, the limiting points (h = 0) will also be said to lie symmetrically with respect 
to this space. 
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by deleting with each odd row and column the following even row and column 
is real for the same reason. Any other principal minor in consequence of I 
is exchanged with another principal minor, but neither is changed in sign by II. 
Hence these two minors must be the conjugate of one another, and the sub 
stitution has accordingly a real characteristic equation, as stated. 

It will be seen later that a non-zero determinant A having the structure (6) 
must be positive. If therefore the substitution is non-singular, its determinant 
can be made equal to + 1 by dividing all its coefficients by a real \A. All 
non-singular substitutions of form (5) are therefore non-loxodromic. 

Consider next the product of any two (non-singular or singular) substitutions 
S, S of the second class with coefficients a,_,- and bn respectively, and with 
the same pairing of equations in each, as indicated in (5). Since the coefficients 
of both obey the law (6), we find at once from (10) that B i+ i, ,+i == Bij, 
B{ + i t j = - B it j + i for odd i and j . Consequently the product of the two 
substitutions has the same form. 

Furthermore, the inverse of a non-singular substitution will also be of like 
form. In fact, since its determinant (7) is real, it is only necessary to see that 
the determinant of the cofactors AH of the elements of (7) consists of blocks 
of the form (6). Now we have just seen that the changes I and II together 
replace every element of (7) by its conjugate and therefore every AH by its 
conjugate. On the other hand, these changes obviously result in an inter 
change of AH and Af+i, y+i for odd i and j , also of Ai+i, j and At, y+i , with a 
reversal of j&gt;ign in the latter case. Hence we have ^4;+i, y+i = AH, 
Ai+i t j = Ai, j+i. Thus the inverse substitution has the same form, and 
we conclude that any number of non-singular substitutions (5) of our second 
kind generate a non-loxodromic group. 

We now proceed to examine the nature of the substitutions of our second 
class. Let X; be a characteristic root of any substitution (2) of the second 
class. To this there corresponds a linear form 

(22) Yi = ci xi + c 2 z 2 + + c n x n (n=2m), 

which in consequence of the substitution undergoes the transformation 
Y( = X YI, and its coefficients satisfy the equations 

Ci ( On X; ) + C 2 2 i + + C n Oni = , 

c i iz + c 2 (a 22 - Xi) + + c n a n2 = 0, 

ci a\ n + c 2 a&lt;i n + + c n (a nn X;) =0, 

the consistency of which is guaranteed by our characteristic equation. Sup 
pose now every quantity which appears in these equations to be replaced by 
its conjugate. We get a system of equations for GI , , c n . Exchange next 
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each odd equation in (23) with the following even equation, then each odd 
column with the following even column, and finally change the signs of the 
n terms of the resulting even equations. Then by virtue of the fact that 
dij, a; +i , y+i, also ttj+i, j and a;, y +1 are conjugates for odd i and j, we get 
precisely the equations which say that 

(24) - Zi = ( c 2 .TI - ci .TO ) + ( c 4 .T 3 - c 3 .T 4 )++( c n x n -i c n _i x n ) 

undergoes the transformation Z{ == X; Zi in consequence of our substitution 
(2). Thus the pair of linear forms which correspond to a simple pair of 
conjugate imaginary roots of the characteristic equation have the form (22) 
and (24). Moreover, the two forms cannot be linearly dependent even when 
X, is self-conjugate and real, for then we would have 

L &gt; Or Co; Co; + C 2 ;_l Co , _!== (i 1 , , m) , 



which would make every c; = in (22). It follows that every real root of 
the characteristic equation must be at least a double root. 

In case the elementary divisors of the characteristic equation are not all 
simple, there correspond s linear forms to each elementary divisor (X X,-)* 
of the Xi which in consequence of the substitution undergo the transformation 

n = x l r 1 , 

H= Fi + X;F 2 , 

(25) H = Y z +\iY 3 , 

r: = F^ + XiF,. 

Let us suppose that Yi is given by the right member of (22), and correspond 
ingly Zi by (24). Put 

(26) 7 2 = diX! + d 2 x 2 + +d n x n (n=2m). 
By use of (1) and (26) in the second equation of (25) we get 

di(au - X;) + d 2 2i + + d n a n \ = Ci, 
di 012 + d z (a 22 - Xi) + + d n a n2 = c 2 , 

di a ln + d 2 a^ + + d n (a nn X) = c n . 

Replace every quantity which appears in these equations by its conjugate, 
exchange each odd equation with the following even equation, then exchange 
each odd column with the following even column, and finally change the signs 
of the terms on both sides of the even equations. We get a set of equations 
which state that the linear form 



(28) Z 2 = (dzXi d-iXz) + + (d n X n -l 
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undergoes the transformation 

^2 = "1 "f" t ill. 

in consequence of our substitution. Proceeding next in the same way with 
the third equation of (25) and so on in turn with each of its succeeding equa 
tions, we find that to the s linear forms YI , , Y s there correspond also s 
other linear forms Z\ , , Z 3 , which are related to them in exactly the same 
way as (24) to (22) and which undergo the transformation 

Zi = A; Zi , 



AS -~ ^s\ "T At AS &gt; 

in consequence of the substitution. The coexistence of (29) and (25) indicates 
that for each elementary divisor (X A;) s there is another divisor (X X,) s , 
and hence gives the following result : 

THEOREM. The elementary divisors of the characteristic equation of any 
non-singular substitution (5) occur only in conjugate pairs (X A,-) 8 , (A A;) 8 
of equal degree, this conclusion holding also for the real characteristic roots A t - 
to which belong pairs of equal elementary divisors. 

Since the determinant of a substitution is the product of the characteristic 
roots, taken each with its proper multiplicity, the theorem just stated gives 
at once the following important corollary: 

COROLLARY. // a non-singular substitution has the form (5), its determinant 
is positive. 

Determinants with real elements satisfying the condition (6) have been 
previously encountered in some work of Voigt and have been called De 
terminants of Voigt by Kowalewski. The proofs* given to show that Voigt s 
determinant is positive or zero fail altogether to cover the more general 
determinants here considered in which the elements are complex. As the 
proof furnished above was indirect, I shall also give a simple and direct proof. 

The non-negative character of the determinant is obvious when m = 1 
since it is then an an + #12 #12 The extension to any other value of m is 
obtained in the following manner by mathematical induction. Suppose it 
to have been established for any value of m that all determinants of order 
2m 2 having the structure (6) are non-negative. Take then any de- 

* Kowalewski, Einfuhrung in die Determinantentheorie, 69, p. 156. Baltzer s and Drude s 
proofs are both found in the Gottinger Nachrichten, 1887. 

Drude and Kowalewski prove, when the elements of (7) are real, that A is the sum of the 
squares of two expressions. When the elements are complex, the natural extension would be 
to replace each square by the product of the expression and its conjugate. But the resulting 
sum no longer represents A. 
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terminant (7) of order 2m , which obeys the same law, and multiply it by any 
minor A 2m _2 of order 2m 2 obtained by deleting from A any odd row and 
any odd column and the following even row and column. This minor has the 
structure (6), and there will be no loss of generality in supposing that it con 
sist of the first 2m 2 rows and columns of A since the deleted rows and 
columns may be moved to the bottom and to the right-hand border of A without 
violating the law of its formation. By a well known identity* we then have 

(30) A2m-2 A = A-2m-l, 2m-l A^m, 2m ~ ^2m-l, 2m A^m, 2m-l 

Now we have already observed that A 2m -i, 2m-i and Ai m , 2m are conjugates, 
while ^2m-i, 2m and A 2m , i m -\ are negative conjugates. Hence the right 
member of (30) is non-negative, being positive unless all four AH on the 
right of (30) are zero. If, therefore, a single one of the minors A 2m _ 2 formed 
in the manner indicated is positive, it follows that A is positive or zero. On 
the other hand, if every minor A 2 m-2 is zero, every AH by (30) must be zero 
and hence also A itself. Consequently, any determinant consisting of blocks 
of form (6) must be non-negative. Hence the determinant of any non-singular 
substitution of form (5) is positive. 

Any non-singular substitution in n variables which consists only of pairs of 
components of form (25) and (29) is a substitution of our second class since it 
can be thrown into form (5) by alternating the equations of (29) with those of 
(25). Thus, for example, the alternation of their first two equations gives a 

matrix of coefficients 

X 

0X00 

10X0 

1 X 
having the structure (6). 

In establishing the existence of the linear forms Y i , Zi for any non-singular 
substitution of the second class, we implicitly reduced it to canonical form. 
If we regard our m pairs of equations of form (22) and (24) as a change of 
variable, the substitution in terms of the new variables consists of pairs of 
components (25) and (29). Now the transforming equations (22) and (24) 
are of exactly the same form as (5). Hence any substitution of the second 
class can be thrown into canonical form by transformation through an appropri 
ately chosen transformation of the same class. Since all the transformations 
(5) form a group, this can be also done without destroying the form (5) of a 
non-loxodromic group in which it is contained. 

An important property of substitutions (5) of our second class is to convert a 

pair of points 

P = (Xi, X 2 ; , Xz m -l , X Zm ) , Q = ( ~ 



* Muir s Theory of Determinants, 90. 
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into another similar pair. By direct substitution in (5) it will be found at once 
that when the substitution converts P into P = (x^-i , .TO;) , i = 1 , ,m, 
it also converts Q into Q = ( x 2 {, 21-1) A pair of such points we will 
speak of as conjugate with respect to the substitution or group. 

We will now prove, conversely, that if any substitution (2) in our 2m variables 
has the property of carrying every pair of conjugate points into a pair of conjugate 
points, it must be one of our second class having the form (5). Consider the 
first two equations of (2) which, for convenience, we will write 



x 2 = 



+ a- 2 ,r 2 + 
+ bi .TO + 



+ , 
+ b n .r n 



(n= 2m) 



The coefficients of either equation will, in general, be determined by the 
requirement that m pairs of conjugate points P J( Q shall be transformed into 
conjugate pairs P , Q . We get thus the following sets of equations for the 
determination of our coefficients i, &,-: 



.Til = 1 Xn + 2 .Tj 2 + + (t 2m-l Z i, 2m-l + -lm -fi, 2m, 
x il = a\ Xi2 + 2 -Til + -- 2m-l ^i, 2m + 2m Xi, 2 m-l 

( ; = 1 , 



T;2 = 01 Xn + 2 i 

x n = bi xa + 



%i, 1m, 
m ^i, 2m-l 

(i= 1 , 



) , 



m ) . 



The determinants of these two sets of equations are identical and must be 
positive or zero since they are composed of blocks of form (6). To exclude the 
value zero, it will be assumed that m pairs of conjugate points P , Q are taken 
which do not lie in a common hyperplane of our complex S n -i . The numer 
ators of the values obtained for a\ and 6 2 are determinants differing only in 
their first two columns which consist of m blocks 



-x n 



x tl 



respectively, while the corresponding blocks in the numerators of a 2 and &i are 



x n 



The remaining pairs of columns are identical for all four numerators and are 
of structure (6). If now in the numerator of b z we exchange each odd row 
and column with the following even row and column and change first the 
signs of the even columns and then of the even rows, each element of the 
numerator determinant is converted into the conjugate of the corresponding 
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element in the numerator of ai . Consequently 6 2 = i A like manipulation 
of the numerator of 6 t gives fei = - a 2 . In the same way we obtain 6 2i = a zi+ i , 
bzi+i 0,2 ,-. Hence the first two equations of the substitution must have 
the structure (6). The reasoning applies to all its succeeding pairs of equations. 
Thus the property of converting pairs of conjugate points into conjugate pairs 
completely characterizes the second class of substitutions. 

One immediate corollary of this result is that the invariant points of a 
substitution of the second class consist of pairs of conjugate points, for 
clearly if one point of a conjugate pair is invariant, the other must be. The 
geometrical interpretation of this is well known for m = 1 . If, namely, 
complex coordinates are given to the points of a sphere in the proper way, the 
formula (4) represents a rotation of the sphere about the diameter passing 
through the two invariant points of the sphere, and the coordinates* of any 
two diametrically opposite points are s, -- 1/2, or, in homogeneous notation, 
(zi, z 2 ) and ( - 2 2 , 21). 

It has thus appeared that our second class of groups generalizes the charac 
teristic form and property of the familiar elliptic rotation groups for m = 1 . 
For m &gt; 1 it accordingly seems fitting to apply the term elliptic to any sub 
stitutions of our second kind, that is, to the substitutions whose elementary 
divisors occur only in conjugate imaginary or duplicate real pairs. The 
general elliptic group will then consist of all non-singular substitutions of the 
second class with the same pairing of the coordinates, as indicated in equation 
(5). A subclass will comprise the restrictedly elliptic substitutions whose 
characteristic roots are n distinct pairs of conjugate imaginaries. 

* Cf. Klein s Vorlesungen uber das Ikosaeder, pp. 32-34. 

UNIVERSITY OF WISCONSIN, 
MADISON, Wis. 



BIRATIONAL TRANSFORMATIONS SIMPLIFYING SINGULARITIES OF 

ALGEBRAIC CURVES* 

BY 

GILBERT AMES BLISS 

In a preceding paper f I have commented on the proofs of the theorem that 
every irreducible algebraic curve can be transformed by a birational trans 
formation into one having no singularities except double points with distinct 
tangents. There are many proofs, two of which seem to be especially inter 
esting. The first is by Walker, J who has worked out in detail an alteration, 
suggested by Klein in 1S94, of a proof originally devised by Bertini for the 
projective plane. In the second, by Hensel and Landsberg,1f reasoning pro 
posed by Kronecker in 188l|| is extended to apply to the proof of the theorem 
in the function-theoretic plane. These two proofs appeal to me as being the 
best ones which I know for the two cases, but both of them are lengthy and 
complicated when all details are taken into consideration. In the present 
paper I have extended once more the method of Kronecker so that it can be 
applied in both the function-theoretic and the projective planes, and have 
attained what I hope will be regarded as simpler proofs of the two corres 
ponding theorems. 

It should perhaps be said that Kronecker s original reduction of the singu 
larities of an algebraic curve to ordinary double points was effective only for 
the finite part of the plane. The alteration of his method to include the 
points at infinity in the function-theoretic plane is a real extension, as one may 
see by a study of the proof of Hensel and Landsberg. Still more is required 
of the transformation if higher singularities are to be excluded from points on 
the curve in the infinite region as well as in the finite part of the projective 
plane. 

In Section 1 below a set of properties of an algebraic function y(x) defined 
by an irreducible algebraic equation f(x,y) = is described, such that when 

* Presented to the Society, April 28, 1923. 

t The reduction of singularities of plane curves by birational transformation, Bulletin 
of the American Mathematical Society, vol. 29 (1923), pp. 161-183. 

J On the resolution of higher singularities of algebraic curves into ordinary nodes, Dissertation, 
Chicago, 1906. 

See a footnote to the paper by Bertini, Trasformazione di una curva algebrica in un altra 
con soli punii doppi, Mathematische Annalen, vol. 44 (1894), p. 158. 

If Theorie der algebraischen Funktionen, p. 402, 2; see also Hensel, Encyclopadie der malhe- 
matischen Wissenschaften, II C 5, 25. 

|| Journal fur die reineundangewandteMathematik, vol. 91 (1881), p. 301. 
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the function y(x} has these properties the corresponding algebraic curve 
/ = has no singularities in the projective plane except double points with 
distinct tangents. For the simplification of the singularities of an arbitrary 
algebraic curve f (x , y} = the method followed in the succeeding sections 
consists in the construction of a pair of rational functions %(x, y) , 77 (x , y) , 
satisfying an irreducible equation &lt; (, 17) = birationally related to f(x, y) 
= 0, and having the properties described in Section 1. In Sections 2, 3, 4 
the construction of and 77 is described and their properties determined. The 
lemma in Section 2 is particularly effective in simplifying the proofs. In Section 
5 the method of the preceding sections is applied in the function-theoretic plane, 
the general plan being that of Hensel and Landsberg, but the details quite differ 
ent. In Section 6 the distinction between the theorems for the projective and 
the function-theoretic planes is explained in terms of homogeneous coordinates. 
1 . A preliminary lemma. Consider an equation / ( x , y} == in which / is 
an irreducible polynomial in x and y of degree n in y. Near a finite value 
x = a the values of y which satisfy this equation are defined by a number of 
series in increasing integral powers of t of the form 

(1) y = 6P + 6T + , 

where t = (x a) l r , r is a positive integer, and the numbers r , n, fj, , are 
relatively prime. Each such series defines r values of y when the r values of 
(x a) 1/r are substituted, and the sum of the integers r for the various series 
in powers of ( x a ) is equal to n . There are only a finite number of values 
x = a, the branch values, for which some of the corresponding exponents r 
are greater than unity. The polynomial A ( x ) is defined to be the product 
of the factors (x a) r ~ l corresponding to the series (1) which have r &gt; 1 , 

A(.r) == n(-r-a)- 1 . 

Near x = o the values of y are given by expansions (1) in which t = ( 1/x ) 1/r . 
LEMMA 1 . An irreducible algebraic curve f(x, y) = of degree n in y has 
no singularities in the projective xy-plane except double points ivith distinct 
tangents, provided that the algebraic function y (x) defined by the equation has the 
following properties: 

1 . at x oo the n values of y (x} are given by n expansions 

(2) y = 

and y (x) has no other poles; 

2. the discriminant of y ( x ) has the form 

D(x} = V(x] 
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where V(x ) is a polynomial ir/fh roots distinct from each other and from those 
o/A(.r). 

To prove this lemma let / ( x , y} be first expanded in powers of x , y in the 
form 

f(x, y} = (x, 7/) + O, y)i + + (x, y) v , 

where ( x , y \ is homogeneous and of degree k in x and y . From the assump 
tion that the series (2) satisfy / = it follows readily that (1, &*) = 
(k = 1 , , n) , and therefore v = n . It is clear that ( .r , y ) is exactly of 
degree n in y since / itself has by hypothesis this property, and furthermore 
that v n , since otherwise / ( x , y) would contain y n multiplied by a poly 
nomial in x , and the function y(x) would necessarily have poles at finite 
values of x , contrary to the property 1 . The intersections of / = with the 
line at infinity are defined by the equation (.r, ?/) = and are evidently 
distinct. None of them is a multiple point of the curve. 

At a finite value x = a distinct from the roots of Z&gt;(.r) every root b of 
/(a, y) is simple and/ == has no multiple point at (a, b) . 

If the roots of / ( x , y) corresponding to a value x are represented by ?/, 
(i = I , , n) , the discriminant has the value 

(3) P(.r) == F(.r) 2 A(.r) == ]I (y,- - y&gt;) 2 (i,j=-l, -,n). 

For a root x = a of A (a-) the n values of y are defined by expansions (1) in 
positive integral powers of ( x a ) 1/r which have the form 

(4) y = b + V( X -a? r + .--, 

since y ( x ) has no pole at a finite value of x . Each of these series furnishes 
r (r 1 )/2 differences i/; yj of order at least 1/r in (x a ) . Since each 
factor yi yj occurs twice in D ( x ) it follows that for each such series D (x) 
contains at least the factor ( x a ) r ~ 1 , whatever the values of b and b may be. 
Since, by property 2 , D (x) can contain no more powers of ( x o ) than are 
contained in A ( x ) , it follows readily that for each series (4) corresponding to 
x = a the coefficient b must be different from zero, and that the coefficients b 
for the different series corresponding to x = a must be distinct. The value 6 
is therefore exactly an r-tuple root of /(a, y} , and (y by is the lowest 
power of ( y b ) alone actually occurring in the expansion 

(5) /O, 2/) = (x - a, y - b)i + + (x - a, y - &) 

of / ( x , y ) at ( a , b ) . It may also be verified that the expansion (4) can 
satisfy / = only if the term in (x a) is present in (x a, y 6) lt 
Hence none of the points ( a , 6 ) corresponding to a root .r = o of A ( x ) is a 
singular point of the curve / = . 
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At a zero x = a of V (x) the expansions (4) all have r = 1 , since the roots 
of V(x} and A (a:) are distinct by the property 2 of the lemma. Since, by 
the property 2, D (x) has exactly the factor (a: - a) 2 , it follows that one 
only of the differences y f yj has a simple zero at x = a , and that for it the 
expansions of ?/ and yj have the form 

,Q. Vi = b + d (x - a) + , 

yj = b + c 2 (x - a) + , 

with ci 5^ c 2 . The value y = b is a double root of /(a, y) , and (y 6) 2 is 
the lowest power of (y b) alone actually occurring in the expansion (5). 
The series (6) can make (5) vanish identically only if the remaining linear 
term in (a; a) is absent and if (1, ci) 2 == (1, c 2 ) 2 = 0. Hence (a, 6) is a 
double point of the curve / == with distinct tangents. The other roots b of 
/ ( a , y) are in this case simple, and the points ( a , b ) are simple points of the 
curve. 

2. Rational functions &lt;r(x, y) with simple poles. When the series (1) and 
the value of x in terms of t are substituted in a rational function a ( x , y) of 
the variables x , y , an expansion for a is found of the form 

(7) &lt;r(x,y) =&lt;& 



The constant n is a positive or negative integer, not necessarily the same as 
the integer jj. in (1), and is called the order of &lt;r (x , y) at the expansion (1). 
On the Riemann surface T of / ( x , y ) = the places P are in one-to-one 
correspondence with the expansions (1). Hence fj, is also called the order of 
cr(x, y) at P . 

A symbol D == P? 1 P? , where the P s represent places on the Riemann 
surface and the n s are positive or negative integers, is called a divisor. The 
sum /*i + + fjL s = -- q is the order of the divisor. A rational function 
&lt;r(x, y} is a multiple of the divisor D if its orders /j. are g ju t - at the places 
Pi (i = 1, , s) and i elsewhere. One of the most important problems 
of the theory of algebraic functions* is the determination of the functions 
a(x,y} which are multiples of a divisor D . It is a theorem that if q &gt; 2p 2 , 
where p is the genus of the curve f(x, y) = , then the number of linearly 
independent multiples of D is exactly v = q p + l-t 

* Hensel, Encijdopddie der mathematischen Wissenschaften, II C 5, p. 557. 

t Consider two divisors Q, Q with orders q, q , and whose product is in the differential 
class ( W ) . Let { Q } , {Q \ represent the numbers of linearly independent multiples of 1 /Q and 
1/Q , respectively. Then, by the Riemann-Roch theorem, 

q + q = 2p-2, (Q ]-{Q}-q + p-l. 

See Hensel und Landsberg, Theorie der algebraischen Funktionen, pp. 301, 304, formulas (la) and 
(II) ; and Hensel, loc. cit., pp. 557-8. If q &gt; 2p 2 then g =2p 2 - q&lt;Q, and the number 
{Q } of multiples of 1/Q is zero. Hence in this case {Q} = q p + 1 . In the text above 
the 1/Q of this footnote is replaced by D . 
Trans. Am. Math. Soo. 19. 
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Consider now a divisor D == Pf 1 P" 1 for which q &gt; 2p + 2. The 
v = q p + 1 linearly independent multiples &lt;TI , , &lt;r v of this divisor are 
everywhere finite except possibly for simple poles at the places in D . The 
coefficient c of the lowest power permitted in the expansion (7) for a multiple 
of D may be called the leading coefficient of &lt;r at the corresponding place P , 
and for the functions a\ , , &lt;j, these coefficients will be denoted by c\ , ,&lt;?. 

LEMMA 2. For the multiples a\, , &lt;r v of a divisor D = Pr 1 P^ 1 
(q &gt; ?p + 2) the matrix 

c v 

(8) 



of leading coefficients at four arbitrarily selected places P , P , P" , P " on the 
Riemann surface off(x, y} = has always rank 4. 

Suppose that this were not true for a set of places P , P , P" , P " . Then 
there would be at least v 3 linearly independent functions of the form 
Vi 0i + + i\ a* with Vi , , v v constants and with leading coefficients 
zero at P, P , P", P" . These functions would be multiples of the divisor 
Di == PP P"P "D of order 4 -- q. But this is impossible, since q - 4 
&gt; 2p 2 and the number of linearly independent multiples of DI is therefore 

q -- 4 p + 1 = v 4. 

COROLLARY 1. The matrix corresponding to (8) for k &lt; 4 places has always 
the rank k . 

If the place P is replaced by P , and the second row in the matrix (8) by 
the second coefficients di , , d v of the expansion (7) for &lt;j\ , , a v , then 
the lemma and its proof would still be valid. The following corollary is there 
fore also true: 

COROLLARY 2. At every place P the second coefficients di, , d v in the 
expansions of &lt;r\ , , cr v are not all zero. 

3. Properties of pairs of rational functions. The properties described in 
this section for pairs of rational functions ( x , y ) , ?? ( x , y) on the Riemann 
surface T of an irreducible algebraic equation / ( x , y) = are well known. * 
They are reproduced here, however, in a form which is especially adapted 
to the applications which are to be made in Section 4. 

In the first place if !-(x, y) has q poles on the surface T, then for every 
constant a the function ( x , y) a will have q zeros, and it is clear that at 
the q places of T where ( x , y} has a fixed value the function 77 ( x , y ) takes q 
values 771, , f] q which may or may not be distinct. To obtain analytic ex- 

* Hensel und Landsberg, loc. cit., p. 247. Appel and Goursat, Theorie des Fonctions 
algebriques, p. 256. 
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pressions for r?i, , 77, for values of near = a, consider the expansions 
analogous to (7) for the functions ( x , y ) a and ri(x,y) near a zero of the 
former. These have the form 



(9) - a = ct p + dt" +1 + -, v --= at" + di t +1 + ( C ^ 0, ci ^ 0) . 

The first equation can be solved for t as a series in T --= ( a) 1/p defining p 
distinct values of t for each near a. when the p values of T are substituted. 
When this series is substituted in the second equation the result has the form 

(10) 7? = /3T* + P T" + . 

The (/_ values rji, , r] q of 77 (x, y) corresponding to a value of near = a 
are defined by a number of equations (10), one for each of the zeros of (x, ?/) 
- a. . The sum of the orders p of these is q . For values near = &lt;x&gt; the 
only difference is that the parameter in (10) is T --= (l/) 1/p . 

The symmetric functions of the values 771, , r] q corresponding to a given 
are rational functions of . For they are clearly representable by series in 
fractional powers of ( a) or l/ near = a or = &lt; , and these series 
can have only integral exponents since the symmetric functions which they 
represent are single-valued in . Hence the symmetric functions, being single- 
valued in and having no singularities except poles, are rational in . It 
follows now that the product ( t] 771) (77 772) (T? 77,) is a polynomial 
of degree q in 77 with coefficients rational in , and when cleared of fractions it 
becomes a polynomial (j&gt; ( , 77 ) in both and 77 . The equation &lt; ( , 77 ) = 
of degree q in 77 is satisfied by all of the values of ( x , y ) , 77 ( x , y } at places 
on T. 

The polynomial &lt;f ( , 77 ) is either irreducible or else the power of a single 
irreducible factor. In fact, each irreducible factor of &lt;/&gt;(, 77 ), being made zero 
by one at least of the expansions (10), must also be caused to vanish identically 
by (9) and all of its continuations on the surface T . But in that case the 
irreducible factor must be satisfied by all of the roots 771, , 77,, and the 
factors of 4&gt; (, 77) must all be identical. The important case when &lt;j&gt; (, 77) 
itself is irreducible will evidently occur if and only if there is at least one value 
for which all of the values 771., , 77 3 are distinct. 

When ( , 1 ) is itself irreducible the equation &lt;f&gt; ( , 77 ) = is birationally 
related to f (x , y} =0. For then the set of series (10) corresponding to 
= a. must define distinct values 771, , 77, for values of near = a, and 
the series themselves must all be distinct. To each one of them corresponds 
therefore a unique place on the surface T , which is equivalent to saying that 
there corresponds to each place on the Riemann surface E7 of &lt; ( &gt; *7 ) =0a 
unique place P on the surface T . The values x , y belonging to P are therefore 
single-valued functions on the surface U . They have at most poles, since as 
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functions of t they have only poles, and since t is expressible in positive integral 
powers of T by solving the equation 

T = t(c + dt+ ...) 1/p 

obtained from the first equation (9) and the equation a = T" . Under 
these circumstances x and y are necessarily rational functions of , 77 , and the 
curves f(x, y) = and &lt;/&gt;(&gt; 7 ?) = are birationally related. 

4. Transformation to a curve with ordinary double points only in the pro- 
jective plane. In this section it is proposed to prove the following principal 
theorem of this paper : 

An irreducible algebraic curve f (x , y} = can always be transformed by a 
birational transformation into a second such curve &lt;f&gt; (, 77) = having no singu 
larities in the projectile ^-q-plane except ordinary double points. 

The method of proof is to construct, with the help of the functions a\ , ,&lt;r v 
of Section 2, a pair of rational functions ( a: , y} , 77 ( .r , y) satisfying an 
algebraic equation &lt;/&gt;(, ~n ) = birationally related to/ (x, y) = and having 
the properties 1 and 2 of Lemma 1 . 

Consider now a divisor D == Pf 1 P^ 1 with q &gt; 2p + 2 , as in Section 2, 
having the linearly independent multiples a\ , , a v . A particular function 

( x , y ) = MI o-i + + u,&lt;r v 

can be chosen with the constants u\ , , u v fixed once for all so that at each 
of the places PI, , P g the order of is exactly -- 1 (Lemma 2, Corollary 1 
for k = 1 ) . For a second function of the family 

(11) T)(X, y) = Viffi + + 1\&lt;T V 

the expansions (10) corresponding to = o arise at the places PI , , P q , 
and are readily seen to have the form 



I 5 \ r-n. i i~ K . 

The constants ft will be distinct provided that Vi , , v v do not satisfy a 
certain system L of linear equations (Lemma 2, Corollary 1 for k = 2 ) . 
Under these circumstances the values 771, , 77, corresponding to a value of 
near = &lt;*&gt; will be distinct, and and 77 will satisfy an irreducible algebraic 
equation &lt;/&gt;(, 77) =0 of degree q in 77 birationally related to / ( x , y ) = . 
Furthermore; since 77 is infinite only at the places PI, , P g where is also 
infinite, the only poles of the function 77 ( ) defined by 4&gt; = are those given 
by the q expansions (12), and these have now the properties prescribed in 1 
of Lemma 1. 

The discriminant of 77 with respect to , 

(13) D(v lt -,*) = II(^-^) 2 (i,j= 1, -,9), 
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is a polynomial in vi , ,,. For it is symmetric in 771 , , 77 a and there 
fore rational in , and it must be a polynomial in since it is finite near every 
finite value of . It may be represented as a product W ( Vi , ,,) A () 
where A ( ) contains all factors of D containing only , and where W has con 
sequently no factor in alone. 

Near a finite value = a. the q values of 77 are defined by a number of series 
(10) which now have the form 

(14) 77 =/3+/3 (-c0 1/p + 

since 77 has no poles except at the places PI, , P q where is also infinite. 
The p values of 77 defined by one of these equations provide p (p 1 )/2 factors 
17 i 77 y in D , each repeated twice, and each of order at least 1/p in ( a) . 
Hence they contribute at least the factor ( a)"" 1 to A () . If the coeffi 
cients Vi , , v, in (11) do not satisfy a certain system of linear equations, then 
the coefficients /3 in the expansions (14) corresponding to = a. will all be 
different from zero (Lemma 2, Corollary 2), and the constants /3 will all be 
distinct (Lemma 2, Corollary 1 for k = 2 ) . For a suitable special choice of 
the coefficients v, therefore, the discriminant D will contain no more powers 
of ( a) than the product of the factors ( a)"" 1 corresponding to 
the different expansions (14) for = a , and it can consequently contain no 
more when the v s are indeterminates . It follows readily that A ( ) is exactly 
the product !!(-- a)^ 1 taken for all the expansions at branch values 
= a, i.e., the values for which the function (a;, y) a has multiple roots. 
The factor W ( vi , , , ) of the discriminant is irreducible or else a 
product of a number of irreducible factors which are also polynomials in 
*i &gt; t i&gt;v&gt; Near a value = o which is not a branch value of the 
discriminant D , thought of as a polynomial in Vi , , v v , is completely decom 
posable into a product of factors 

(15) T?; f]j = fli (ffn - (Tiy) ++ (0V - (r,y) 

linear in vi , , and with coefficients power series in o with constant 
terms not all zero (Lemma 2, Corollary 1 for k = 2 ) . Each irreducible 
polynomial factor of D must therefore be expressible near = o as the product 
of a number of distinct factors of the form (15), and each must occur at least 
twice in W since each linear factor (15) occurs twice in D . An irreducible 
factor of W could not occur more than twice, since otherwise there would have 
to be at least two linear factors 77; 77 y , r] k -qi having (i, j} ^ (k , I) and 
with proportional coefficients. This would imply a relation 

77; - T?; = (r] k - rn )g 
holding identically in Vi, , v v , where g is a series in ( o) with constant 
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term different from zero. But this would contradict the validity of Lemma 2 
at = o , or its Corollary 1 for k = 3 when only three of the integers i,j,k,l 
are distinct. It is clear then that each irreducible factor in W occurs exactly 
twice, and D has the form 

D(VI, ,,-,) = V(vi, , ) 2 A (), 

where V is a product of distinct irreducible polynomials in ?&gt;i, ,? , con 
taining no factor in alone. 

Suppose now that the coefficients v\, -, have numerical values not 
satisfying the linear system of equations L . Then, as has been shown above, 
the functions (.r, y) , rj(x, ?/) satisfy an irreducible algebraic equation 
( , 17 ) = birationally related to / ( x , y ) == , and the only poles of the 
function 77 ( ) defined by &lt;j&gt; ( , rj ) =0 are given by the expansions (12) which 
have the form prescribed in 1 of Lemma 1 . If furthermore vi, , v p are so 
chosen that the roots of the irreducible factors of V ( Vi , , v v , ) are distinct 
from each other and from those of A ( ) , then D has the form prescribed in 2 
of Lemma 1. Hence 0(, 17) =0 has in the projective plane only double 
points with distinct tangents. 

5. Transformation to a curve with ordinary double points only in the func 
tion-theoretic plane. The projective xy-plane is the extension of the euclidean 
?/-plane which is obtained by introducing the homogeneous coordinates 
x = Xi/Xs , y = a 2 /.T 3 . The function-theoretic plane, on the other hand, is 
obtained when the homogeneous coordinates x = .TI/.TO, y = yi/y% are used. 
In the former case the infinite region is the line .r s = . In the latter case it 
is the totality of points whose non-homogeneous coordinates have the form 
( &gt; , y ) or ( x , OD ) , or whose homogeneous coordinates have a- 2 = or y* = . 

An irreducible algebraic curve f(x,y) = is said to have ordinary double 
points only in the function-theoretic .ry-plane if its singularities in the finite 
part of the plane are ordinary double points, and if its points at infinity provide 
only simple points or double points with distinct tangents when they are 
transformed into the finite part of the plane by means of one or both of the 
transformations x = I/a: or y = 1/y . It is possible to transform an irre 
ducible algebraic curve /(a:, y) = into one with only ordinary double points 
in the function-theoretic plane by a birational transformation of the form 

= x, 77 = r](x, y}, 

as will be shown in the following paragraphs. The first step in the proof of 
this statement is the following lemma analogous to Lemma 1 of Section 1. 

LEMMA 3. An irreducible algebraic curve f(x,y)=0of degree n in y has 
no singularities except double points with distinct tangents in the function-theoretic 
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plane, provided that the function y(x) defined by the equation / = has the 
following properties : 

1. at x = oo the function y(x] has n distinct finite values; 

2. the poles of y ( x ) are simple and at values x = a distinct from each other and 
from the branch values of x; 

3. the discriminant of y (x) has the property 3 of Lemma 1. 

It is clear from 1 that the points ( &lt; , y) on the curve are simple when 
transformed into finite points by the transformation x =-- I/a; , and from 2 the 
same is true of the points (x, &lt;x&gt; ) when transformed by y = l/y . 

The finite roots of / ( x , y) for a value x = x over which y ( x ) has a pole 
are all simple, by 2, and hence they provide only simple points of the curve 

f(x,y) = 0. 

The discriminant of y ( x ) is by definition 

D(x) = 



where A is the product of the linear factors x a o belonging to values a o for 
which y (x) has poles. By a repetition of the argument of Section 1 it is 
provable that f(x, y} = has only double points with distinct tangents 
corresponding to values x = a other than those discussed above, and the 
lemma is therefore proved. 

To construct a rational function 77 ( x , y ) whose irreducible algebraic equation 
&lt;t)(x, ?;) = is birationally related to f(x, y} = and has the properties 
1 , 2 , 3 of Lemma 3, one may start from a divisor D = Pf 1 P^ 1 with 
q &gt; 2p + 2 , as in Section 2, whose places Pi on T correspond to finite values 
of x distinct from each other and from the branch values. If the coefficients 
Vi, , v v in the expression 

ri(x,y) = Viai + + v, a, 

do not satisfy a certain system LI of linear equations, the function t](x, y) 
will have distinct finite values at x = (Lemma 2, Corollary 1 for A; = 2) and 
a pole of order one at each place Pi (Lemma 2, Corollary 1 for & = 1 ) . Since 
under these circumstances the n values of 77 corresponding to a value x near 
x = oo are all distinct, it follows that 17 ( x , y ) will satisfy an algebraic equation 
(j)(x, 77 ) = birationally related to / ( x , y) = and having the properties 1 
and 2 of Lemma 3. 
The discriminant of 77 ( x , y} with respect to x is by definition 
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where A is the product of the factors (x a - ) belonging to the places P of D , 
and where r?i, , t] n are the n values of r\ (x , y) for a given value x. This 
expression for D is symmetric in 771, , t] n and therefore rational in a:. It 
can become infinite only at the values XQ corresponding to the places Pi . At 
such a value a- only one of the values r? 4 - has a simple pole. There are therefore 
only n -- 1 of the factors 771 t]j which have simple poles, and in D these are 
annulled by the factor A 2n ~ 2 . Hence D is finite for all finite values of x and 
is a polynomial in v\, , v v , x. It is expressible in the form 

W(v it ,v f , x)A(x) 

where W contains no factor in x alone. 

It may now be proved, exactly as in Section 4, that A ( x ) is the product 
II ( x a} r ~ l formed for all the branch places on T , and that W is the square of 
a product V ( TI , ,, x ) of distinct irreducible factors. Hence if v\ , , 
v v are chosen not satisfying the system LI of linear equations, and so that the 
roots of F are distinct from each other and those of A (x) , then &lt;j&gt; (x, tj ) = 
will have the properties 1,2, and 3 of Lemma 3, and the curve which it defines 
in the function-theoretic plane will have only double points with distinct 
tangents. 

6. Homogeneous coordinates. The branches of an irreducible algebraic 
curve / ( x , y ) = have for x = oo and x a the forms 

x = t~ r , y = bt" + b &lt;" +-, 

x = a + t r , y = bt + b i + . 

These, with their transforms after replacing t by a series ct + dt 2 + ( c T* ) , 
are included in the type 

(16) x = P(t], y = Q(t), 

where P and Q are series in integral powers of t having at most a finite number 
of terms with negative exponents. 

In terms of homogeneous coordinates x = Xi/x 3 , y = x 2 /x s the branch (16) 
takes the form 

(17) xi = ai + ai t+ (Z=l,2,3) 

where (01,02,03) ^ (0,0,0). A branch of this character is said to be linear 
if the matrix 

i az 

on d i 0:3 

is of rank 2 . Two such branches with the same center a\ : a 2 : a s have distinct 
tangents if the determinant 

i a 2 a 3 
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is different from zero, where a( , a 2 , a 3 are the coefficients of t in the second 
branch. 

In the project! ve plane a curve f(x, y} = has no singularities except 
double points with distinct tangents if all of its branches are linear, if the same 
center is never shared by more than two branches, and if every pair of branches 
with a common center has distinct tangents. It is readily provable that the 
branches (17) of a curve/ ( a- , y ) = such as is described in Lemma 1 of Section 
1 have these properties, and that the properties themselves are invariant under 
transformations of the parameter t and projective transformations 



Xi = an 1 + GHZ 2 + i3 3 



i= 1,2,3). 



In terms of homogeneous coordinates x = Xi/Xz, y = 2/1/2/2 the branches 
(16) take the form 

7-. /I. _1_ /V. / -1- . . . 
**% - U/j ^ UC-i fc I , 

Such a branch is said to be linear if the determinants 

1 on bi 0i 

2 a 2 "2 02 

are not both zero. Two branches with the same center 

/ . IT . L \ { n \h r h } 

have distinct tangents if the expression 



Oil 



b{ 0i 
b 2 02 



lai 



Oil 



61 



is different from zero, where the primes designate the coefficients of the second 
branch. 

A curve with no singularities except double points with distinct tangents in 
the function-theoretic plane is now denned as in the next to last paragraph 
above. The curves described in Lemma 3 of Section 5 have these properties, 
and the properties themselves are invariant under transformations of the 
parameter t and the transformations 



= an 1 + oiiz 
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(t=l,2). 
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A SYMBOLIC THEORY OF FORMAL MODULAR COVARIANTS * 

BY 

OLIVE C. HAZLETT 

PART I. INTRODUCTION 

1. Prologue. Thus far, very little has been published on the general theory 
of formal modular invariants or covariants. Workers have, on the whole, 
obtained results for special, more or less isolated, cases; and although some 
beautiful and important general theorems have been proved, they are more or 
less unrelated. This is, of course, only natural in any division of knowledge 
in its formative state. 

Nevertheless, no worker in the field could fail to be conscious of a certain 
uniformity common to the special cases that have been studied in detail; 
though (alas!) this uniformity usually appeared to be broken ruthlessly in the 
next case studied. This breaking of an apparent law signified, however, 
merely that we did not know these special cases with a sufficient thoroughness 
of illuminating detail, or were trying unwittingly to make the laws conform to 
certain standards, unconsciously preconceived. This latter handicap was laid 
on us naturally enough by our thorough knowledge of algebraic invariants and 
the fact that this newer kind of covariants is, in many ways, strikingly like 
the older, classic covariants, though so tantalisingly different. 

Their similarity and their difference show themselves in the very beginning 
of the study: in the definitions, in the simplest examples. Perhaps the dif 
ferences that first come to mind are those which are inherent in the fields of 
definition, which, in the case of classic covariants, is the field of reals or ordinary 
complex numbers and, in the case of modular covariants, is a Galois field, 
GF[p n ] , of order p n . These differences are too obvious to mention in detail, 
but one who has studied the beautiful proofs given by the old masters of 
invariant theory has been forced to the conclusion that most of the proofs 
seemed to use the properties of a field of characteristic zero, not in some 
accidental manner, but rather in veriest necessity. 

Growing from the surface differences between the two fields are two very 
important distinguishing characteristics of the two kinds of covariants. It 

* Part II was presented to the Society, September 7, 1920; Part III, December 28, 1921; 
Parts IV and V, December 27, 1922. 

The reading of the literature in connection with this paper was much facilitated by the 
purchase of books with a grant made by the American Association for the Advancement of 
Science and this help is herewith gratefully acknowledged. 
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is well known that an algebraic covariant is necessarily such that all of its 
terms have the same weight i.e., it is isobaric; but a modular covariant is not 
necessarily isobaric. Nevertheless, although its terms are not in general of 
exactly the same weight, their weights can differ at most by multiples of 
p n - 1 . Moreover, if two classic invariants are identical in value for all 
marks of the field, they are necessarily identical in form, and conversely; 
whereas, if two formal modular invariants are identical for all marks of the 
field of definition, they are not necessarily identical in form. Nevertheless, 
although two such invariants can be of different degree and appear quite 
different in form at a casual glance, yet they necessarily have in common 
certain fundamental characteristics.* There are other differences that will 
occur to any worker in the field, but I think that I have mentioned the most 
refractory. 

As indicated above, in spite of the great differences between the algebraic 
covariants and formal modular covariants, there are certain fundamental 
likenesses which are more easily sensed than they are analyzed. A feeling 
that there is some theory which underlies all the special cases, and yet which is 
comparatively simple, made the writer try to crystallize this theory into words. 

In the spring of 1918, came the feeling that the theory of formal modular 
covariants of a binary form, /, for the field GF[p n ], must be, in essentials, 
equivalent to the theory of simultaneous algebraic covariants of / and certain 
other forms obtained from/ by replacing the coefficients of/ by their respective 
p n th powers. This is natural enough, since these powers of the coefficients of 
/ are cogredient with the coefficients of / for the transformations of the group 
used, and this is the only way in which a formal modular covariant differs 
from an algebraic covariant of /. Then there appeared other indications 
that there is an intimate relation between formal modular covariants of / and 
algebraic covariants of a system of forms consisting of / and related forms. 
This paper is an attempt to put in systematic form the theorems which emerged 
when these eventually crystallized. 

2. Summary of literature. Let 

/(a; x) = a x m + a t x m ~ l x z + + a m - l Xi x m ~ l + a m x m 
be a binary form and let G be a group of linear transformations 

(1) x i = ^x 1 + r, i x a (=1,2) 

whose coefficients, the s and rj s, are in the field F . If / ( a ) be a polynomial 
in the a s which, under all transformations of the group, is transformed into 
I ( a ) such that 

(2) I(a }=D w I(a) (in the field) 

* Some of these characteristics are indicated in a paper by Hazlett, these Transac 
tions, vol. 22 (1921), pp. 144-157, especially p. 145. 
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where 



then I is said to be an invariant of /under the group G . Similarly, if C (a; re) 
is a polynomial in the a s and the x s which, under all transformations of the 
group G, is transformed into C (a ; x ) such that 

(3) C(a -x ) ---- D u C(a;x) (in the field), 

then C is said to be a co variant of / under the group G . Also, we speak of 
invariants and covariants of a system S of binary forms /,- . 

When the field of definition, F, is the field of all reals or the field of all 
ordinary complex numbers, the covariants of S are the ordinary covariants of 
&lt;S of the classic invariant theory, with which are associated the names of 
Cayley, Sylvester, Hermite, Aronhold, Gordan, Clebsch and Hilbert. 

When the field of definition, F , is the Galois field, GF[p n ], of order;;", the 
covariants of *S are called modular covariants. Here the s and TJ S are marks 
of the Galois field, GF [ p n ], of order p n * defined by the prime p and an 
algebraic equation, P(A ) == 0, of degree n; and thus (2) and (3) are con 
gruences, reduced modulis p and P(A ). This means that, in (2) and (3), 
the left member is identical with the right member if we replace the p"th power 
of , and of rn by ,- and ??,- respectively, in view of Galois generalization of 
Fermat s theorem. f For a modular field there are two different types of 
covariants. If the coefficients of the forms (the a s) range over the marks of 
GF [p n ] , and if (2) and (3) are congruences which are true if Fermat s theorem 
is applied, not only to the s and rj s, but also to the a s, then the covariants are 
called simply modular covariants. If, on the other hand, the a s are independent 
variables, Fermat s theorem does not apply to them; and hence, in (2) and (3), 
the left member is understood to be identical with the right member without 
any reduction in the exponents of the a s. Such covariants are called formal 

* Let p be any prime and let P ( X ) = be any algebraic equation of degree n which has 
its coefficients integers reduced modulo p and which is irreducible, modulo p . Then, if we 
reduce any polynomial in X modulis p and P ( X ) , we obtain a polynomial of the form 

M(X) = Cn-i X" 1 + c n _ 2 X"- 2 + +c,A + c 

where each of the c s is an integer of the set , 1 , , p 1 . The totality of all polynomials 
congruent modulis p and P (X) to the same M (X) is said to form a class of residues. Since 
there is a class of residues for each set of values for the c s, there are p n such classes. The 
totality of the classes of residues are closed under addition, subtraction, multiplication and 
division (provided the divisor is not zero) and so constitute a field containing p n marks or 
elements. Any two such fields are the same for a given p and a given n . This is called a 
Galois field of order p n and is denoted by GF[p n ] . Moreover, any finite field is simply iso- 
morphic with a Galois field. 

t If a is any mark of GF[p n ], defined by p andP(X), then a P" = a (moddp, P(X)). 
See any standard work on Galois fields, such as Dickson, Linear Groups, Teubner, 1901, p. 11; 
Serret, Cours d Algebre Superieure, vol. 2, p. 180. 
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modular covariants. If the covariant is independent of the variables, it is 
called a modular invariant or a formal modular invariant according as the o s 
are marks of the field or independent variables . 

Although Hurwitz introduced the notion of formal modular invariants in 
connection with the determination of the number of solutions of higher con 
gruences,* Dickson discovered them independently four years later from a 
different point of view.f Practically all important results in the theory are 
due to Dickson. J 

Clearly, every formal modular covariant of a system S is a modular covariant 
of S, though not every modular covariant of S is also a formal modular co 
variant of S . For a modular invariant, I , of S is concerned solely with the 
values of 7 for the different sets of values of the o s in the field of definition, 
whereas a formal modular invariant of S is concerned not only with the values 
but also with the form of / . This statement has to be modified somewhat for 
covariants that are not invariants, since, in (2) and (3), the left member is the 
same as the right member in a purely formal sense as far as the x s are con 
cerned, for both kinds of covariants. For example, if a is any one of the 
coefficients of/, then a pn - a always has the value zero when a is a mark of 
GF[p n ] and hence it is a modular invariant of/; but it is not a formal modular 
invariant, since it is changed in form if we interchange xi and x 2 . To take 
a less trivial example, 

q =-- (a + c) (b 2 + ac - 1) 

is a modular invariant of / = ax\ + 2bxi 2 + cx\ , mod 3 , but is not a formal 
invariant, since a + c goes into - a b + c under the transformation 
%i = x( + x 2&gt; x 2 = x 2 . 

In 1909, Dickson studied modular invariants from a different point of 
view and introduced the notion of classes of forms. This enables one to see 
to the very heart of the theory and the finiteness theorem follows almost 
directly from the definition. Later, || he proved the finiteness theorem for 
modular covariants. 

* Ueber hohere Kongruenzen, Archiv der Mathematik und Physik, ser. 
3, vol. 5 (1903), pp. 17-27. 

f Invariants of binary forms under modular transformations, these Transactions, 
vol. 8 (1907), pp. 205-232. 

J Anyone wishing to gain familiarity with this beautiful theory should read the article by 
Hurwitz mentioned above and Dickson s papers, of which the most fundamental are I. General 
theory of modular invariants, these Transactions, vol. 10 (1909), pp. 123-158; II. Proof 
of the finiteness of modular covariants, ibid., vol. 14 (1913), pp. 299-310. Also one should read 
the brief but important paper by Miss Sanderson, Formal modular invariants with application 
to binary modular covariants, these Transactions, vol. 14 (1913), pp. 489-500. All 
results obtained up to 1914 are summarized in Dickson s Madison Colloquium Lectures, and 
all essential results published up to August, 1922, are summarized briefly in Chapter 19 of his 
History of the Theory of Numbers, vol. 3. 

Dickson, I. 

|| Dickson, II. 
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There is, however, an intimate relation between modular invariants and 
formal modular invariants. For Miss Sanderson s theorem* tells us that, 
corresponding to any modular invariant, i, of a system S under any group G 
of linear transformations with coefficients in GF [p n ] , there is a formal invari 
ant, / , under G such that 7 = i for all sets of values, in the field, of the coeffi 
cients of the system S . This enabled her to construct modular co variants of 
a system S of binary forms, /,-, from the modular invariants of another system 
S , consisting of the forms / and an additional linear form with coefficients .r 2 
and -- xi . Moreover, every modular co variant of S is a polynomial in the 
universal covariant, L = x^" x z xi x$ n , and the modular covariants ob 
tained in the manner indicated by Miss Sanderson s theorem. f Since every 
algebraic covariant of S is obtained from the algebraic invariants of S with 
out the need of any additional covariant, analogous to L, this result shows 
one of the fundamental differences between algebraic and modular covariants. 

In connection with this last remark, several minor results are of interest, 
as they point the way toward more general results. In 1920, it was shown that, 
for the field GF [ p n ] with p ^ 2 , every modular covariant of a binary form, 
/, whose degree is not divisible by p, is expressible as a rational function of 
the universal modular covariants, L and Q, and of algebraic covariants of/. 
For formal modular covariants of /, there is a theorem which is more com 
plicated in statement but similar in essence. J Then, in 1922, W. L. G. 
Williams announced that every formal modular seminvariant of / (aside from 
a power of a ) is a polynomial in the algebraic protomorphs and in 

(3 = a? - d ar&gt; 

for GF[p] when the form /is such that binomial coefficients can be used. He 
also proved the analogous theorem for formal seminvariants of two or more 
binary forms. In this latter, it is interesting to note that, to a fundamental 
set of algebraic protomorphs, it is necessary to adjoin only one new semin 
variant, namely, one of the same type as /3 but formed for any one of the forms 
/j . It will be seen that these results indicate relations between modular 
covariants and algebraic covariants, in spite of their superficial differences. 

3. Summary of this paper. We first explain a symbolic notation for formal 
modular covariants which is like that generally used for algebraic covariants 
in most respects, but is (necessarily) different from the latter in one essential. 

* These Transactions, vol. 14 (1913), pp. 489-500. 

t Hazlett, A theorem on modular covariants, these Transactions, vol. 21 (1920), 
pp. 247-254. 

t Hazlett, Associated forms in the general theory of modular covariants, American 
Journal of Mathematics, vol. 43 (1921), pp. 194-196. 

Fundamental systems of protomorphic formal modular seminvariants of binary forms, read 
before the Society (Rochester, N. Y.), Sept. 8, 1922. I saw this article in MS. after announcing 
to the Society (Toronto), on December 28, 1921, the results of Part III of this paper. 
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This notation, however, has the advantage that it can be used for both algebraic 
co variants and formal modular co variants; and thus, by its use, the proof of 
every theorem in this paper is such that it applies both to the modular and the 
non-modular cases. To help the reader appreciate the diversity of these two 
interpretations of this theory, the more important theorems are applied to a 
number of special cases. 

As in the symbolic theory of algebraic covariants, it is readily seen that all 
formal modular covariants are polynomials in a finite number of symbolic 
expressions having the invariant property. Then are proved several other 
elementary theorems which assume a simple and familiar form for algebraic 
covariants, but which, for formal modular covariants, assume a form rather 
strikingly different. 

It is then shown that every isobaric formal modular covariant, C , of the 
system S is congruent, in the field, to a function of the coefficients of S which 
is (in a certain general sense) an algebraic covariant of S . When, however, 
C is not isobaric. it is not congruent to an algebraic covariant of S; but it is, 
nevertheless, congruent to an algebraic covariant of an enlarged system, S , 
consisting of the forms of S and other forms obtained from the forms of S by 
replacing the coefficients of S by their p"th powers. By various devices, there 
are expressed in symbolic form the formal modular invariants of a fundamental 
set for the cubic, modulo 2, and for the quadratic, modulo 3; and the above 
theorems are verified. 

Finally, by using the same theorems, we prove that the set of all formal 
modular covariants of any binary system, S , with respect to the Galois field, 
GF[p n ], is such that (1) all syzygies among them are consequences of a finite 
number of such syzygies; and (2) all formal modular covariants are expressible 
as polynomials in a finite number of such covariants. 

PART II. SYMBOLIC NOTATION 

4. Explanation of the notation. As in the theory of algebraic covariants, 
formal modular covariants of a binary form, /, assume a form which is both 
simple and elegant when the form / is expressed as a product of symbolic linear 
factors. In the theory of algebraic covariants, it is customary to express a 
binary form / of degree TO as a symbolic TOth power ; but, for reasons which 
were explained elsewhere in detail,* it is not possible to express the general 
form/ of degree m as a symbolic mth power and then express all formal modular 
covariants by means of these symbols for the general Galois field. 

For this reason, represent 

/ = o a? + i xr 1 xi+~ - 

* New proofs of certain finiteness theorems in the theory of modular covariants, these Trans 
actions, vol. 22 (1921), pp. 152-153. 
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as 

/ = ira x = TT ( on a-i + 2 a- 2 ) = ( i Zi + a 2 z 2 ) ( ft Xi + & .r 2 ) 



where it is understood that there are m symbolic factors which are symbolically 
distinct, and consider its formal modular co variants under the group of trans 
formations (1) where the s and rj s are any marks of the Galois field, GF[p n ] , 
of order p n such that the determinant is not zero in the field. Then, as has 
been proved elsewhere,* every rational integral covariant (both classic and 
modular) of / is a polynomial in the symbols a, /3, and in the .T S. The 
converse is also true; though, in order that a polynomial in the .T S and in the 
symbols be rational in the a s, ( i ) it must be symmetric in the m pairs ( ai , 2 ) , 
(/3 i; j3 2 ) , and such that each term contains as many a s as /3 s, as many 
j8 s as 7 s, etc.; or (ii) it must be a sum of a number of expressions described 
under (i). Similar remarks apply to co variants of several binary forms. 
For convenience, any polynomial in the x s and the symbols which has the 
invariantive property under a transformation of the group will be called a 
symbolic covariant, even if it do not satisfy the conditions that it be rational 
in the a s. 

5. Fundamental set of symbolic covariants. Early in the symbolic theory 
of algebraic covariants, it is proved that all algebraic invariants are expressible 
as polynomials in a finite number of symbolic invariants of the type 

( a/3 ) = ai 182 a 2 ,81 . 

Similarly, all algebraic covariants are expressible as polynomials in a finite 
number of symbolic covariants of the type ( a/? ) and a x = on xi + 2 -?2 
For formal modular covariants we have a similar theorem, though we have 
to use new types of symbolic covariants. 

For simplicity, we shall first consider covariants of a single form, / = ira x . 
Now it is known that the pair ( on , a 2 ) is pseudo-cogredient with ( a % 2 , Xi ) . f 
Accordingly, any formal modular covariant may be regarded, for any purpose 
not involving the notions of weight or index, as if it were actually an invariant 
of TO + 1 cogredient pairs, consisting of the pairs of symbols and the pair 
(xz , a, i ) But it is known that the formal modular invariants of a number 
of cogredient points have the finiteness property.^ Notice that this argument 
applies equally well to a system of binary forms, and thus we have 

THEOREM I. All rational integral formal modular covariants of a system of 
binary forms, fi = ir( ai xi + a 2 x 2 ) = a* j3 x , with respect to the Galois 

* Preceding reference, top of p. 153. 

f That is, the two pairs are cogredient aside from a multiplicative factor which is a power 
of the determinant of the transformation. 

J F. B. Wiley, Proof of the finiteness of the modular covariants of a system of binary forms and 
cogredient points, these Transactions, vol. 15 (1914), pp. 431-438. 
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field GF[p n ], of order p n , are polynomials in a finite number of symbolic co- 
variants which are polynomials in the x s and in the as, jS s, . 

6. Illustration of Theorem I. Dickson* has shown that all formal modular 
invariants of the binary quadratic / = a x\ + 2a x x l x 2 + a 2 x\ , modulo 3 , 
are polynomials in 

A = a? a 02 , J = o 70 = o ( o + i + c 2 ) ( a + 2aj + a 2 ) a 2 , 

B = 13 ji = ai(a\ al ) ( a 2 a ) ( a\ a\ ) , 

T = ( a + a 2 ) ( 2a + 2ai + a 2 ) ( 2a + ai + a 2 ) . 

If we express /in symbolic form (4) as/ = (^.TI + ct 2 .r 2 ) (PiXi + j3 2 a: 2 ), 
then = i|8i, i = -- (aij8 2 + a 2 |8i) mod 3, o 2 = a z /3 2 . Hence it is 
readily seen that 

A = (a/3) 2 , J == (a 3 a)(/3 3 /3), 
5 = Z(a 3 /?) 2 (/3 3 /?), 

T= - Ni (mod 3), 

where 

Ni == [(a 3 /?) 3 (0 3 /3) -- (0 3 a) 2 (a 2 a 



is one of a fundamental set of formal invariants of the pairs ( ai , a 2 ) and 
(ft, j8 2 ) and is an integral function of the a s and /3 s. Thus Theorem I is 
verified for this case. Note, also, that the rational invariants of / are ex 
pressible as rational functions in the determinantal symbolic invariants, 
(a/3), ( 3 /3), (a/3 3 ),.( 3 &lt;*), (/3 3 /3). 

PART III. RELATION BETWEEN CLASSIC AND MODULAR COVARIANTS 
7. Two kinds of congruences. In the following sections, we shall frequently 
have to distinguish between two kinds of congruences, identical congruences 
and residual congruences. Two polynomials 0i (a;) and &lt;f&gt; 2 (x) , with in 
tegral coefficients, are said to be residually congruent with respect to the field 
GF[p n ] if 0i (a 1 ) = 2 (x) whenever x is any mark of the field; if, however, 
0i (a:) =02 (a:) when x is any number whatsoever, then 0i (x) is said to be 
identically congruent to 2 (a;) . For example, x pn = x is a residual congru 
ence, and ( p + 1 ) x = x is an identical congruence. 

Moreover, if 4&gt;i and 2 are polynomials in two sets of variables, the Xi and 
yi , then we might have a congruence which is residual as far as the x s are con 
cerned, but which is identical as far as the y s are concerned. For example, 
when C is a modular covariant, (3) in 2 is a residual congruence as far as the 
a s are concerned but is an identical congruence as far as the a; s are concerned. 
If, on the other hand, d is a formal covariant, then (3) is an identical con 
gruence with respect to both the a s and the a; s. In each case, however, (3) 
* Madison Colloquium Lectures, p. 42. 

Trans. Am. Math. Soc. 30. 
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is a residual congruence with respect to the s and T? S. Since there is no 
simple notation which will indicate those variables with respect to which a 
congruence is residual, we shall indicate them in words directly after each 
congruence. 

8. Manner in which covariants are transformed. An algebraic covariant, 
C , is usually denned as a function such that its transform, C , is given by the 
formula C = D w C where D is the determinant of the general transformation 
of the group; but it is often defined as a function such that C" = MC where M 
depends merely on the coefficients of the transformation, and the multiplier 
is then proved to be a power of D . Now it has been customary thus far to 
define a formal modular covariant in a manner analogous to the first of these 
methods, but, in this paper, we shall use the second definition and then prove 
it is equivalent to the first. 

Although the definition of a formal modular covariant says nothing explicitly 
about the nature of its transform except for values of the s and 77*3 in the 
field, let us see if this indirectly imposes any restriction on the actual form of 
the transform before we reduce the exponents of the s and 77 s by Fermat s 
theorem. For example, under the transformation (1), LI -= a i" a ai a%" 
is replaced by the same function of the primed letters, which is identically 
equal to 

D! r +1 + Z&gt; 2 af a 2 - D 3 i a" + D t c% +l , 
where 

D! = f m - & rT, D, = ? - ;- , 



It will be observed that each Z); is a formal modular invariant of the two pairs 
(i&gt; ^i) an d (2, 772) when they are transformed cogrediently, and that 
DI s Z) 4 = 0, D 2 = DS = D , when the s and TJ S are all in the field. 

Let C be any formal covariant of the system S . Then, under (1), C formally 
i.e., before any reduction is made modulo p goes into 

(4) C(a ;x ) =2D i (!:,r,)P i (a;x) 

where each D* and each P z is a polynomial in its arguments. If (1) is followed 
by the transformation 

a5 = .-ai+/&lt;a ft =1,2), 

where 

A = ejz - &lt;? 2 /i = (in the field), 

this is equivalent to applying to the original forms of S the single transformation 



* These multipliers, D t , will depend on the covariant, C . When C is L , then the D s are 
as given in the preceding paragraph. 
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where the two pairs (l, r;l) and (&, 773) are obtained respectively from the 
pairs ( 1 , 771 ) and ( , , ^72 ) by applying the transformation of matrix 



(5) 



e 2 \ 

i /J 



Accordingly, if in (4) we replace the a and the x by the corresponding a" 
and x" , and apply the transformation (5) to the pairs ( , 77 ) , then we get an 
equation of the type (4) but formed for the new transformation (I ). Thus, 
before any reduction is made modulo p, 

(4 ) C(a"-,x") =SZMr,r? )P i (a;aO; 

and also 

(6) C(a"; x") = MC(a ; x } (in the field), 

whenever the e s and / s are in the field. Both (4) and (4 ) are ordinary 
equations, but (6) is a congruence in the field, being an identical congruence 
with respect to the a s and x s. Combining these, we have 



as an identical congruence in the s and TJ S. Hence we have proved 

THEOREM II. // C is any formal modular covariant of a system, S , of binary 
forms with respect to the Galois field GF [p n ], of order p n , then under any 
linear transformation (1) of the group, C is formally replaced by an expression of 
the form SD; (, 77) P,- (a; a, 1 ) where each Pi is a polynomial in the a s and x s 
and each D, is a formal modular invariant of the two cogredient pairs (1, 771) 
and ( 2 , 772 ) 

In a later section, we shall prove that each D{ which is not congruent to 
zero when the s and TJ S are in the field is congruent to a power of D (the 
determinant of the transformation) whenever the s and ?? s are marks of the 
field. For isobaric co variants, see Lemma 4 ( 10); for pseudo-isobaric 
covariants, see 16. 

Note that this proof holds equally well for classic covariants, and thus, both 
for classic covariants and modular covariants, the theory of covariants of a 
system of binary forms has its foundations in the theory of invariants of two 
cogredient points. The essential differences between the modular and classic 
theories are, accordingly, associated with two facts: (1) in the classic case, all 
invariants of two cogredient points are polynomials in one invariant, the 
determinant 1*72 771 2; whereas, in the modular case, several other types 
arise; (2) in the classic case there is no invariant, other than zero itself, which 
vanishes whenever the s and rj s are in the field; whereas, in the modular 
case, there are invariants of the two pairs which are not identically congruent 
to zero and yet vanish whenever the s and rj s are in the field. From (1) 
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we see why the transform of a modular covariant is not in general identically 
a multiple of the original covariant, and from (2) we see why the transform of 
a modular covariant in general contains terms having no counterpart in the 
original covariant, before Fermat s theorem is applied. Both remarks are 
illustrated by L at the beginning of this section. 

9. Remarks on isobarism. Another fundamental difference between alge 
braic and modular covariants was mentioned in the introduction and will now 
be discussed briefly. An algebraic covariant is always isobaric i.e., all its 
terms have the same weight,* In fact, an elementary way of determining 
classic invariants of /(a; x} is to write down a general linear combination of 
all possible terms having the same degree in the a s and the same weight and 
then determine what the coefficients of these terms must be in order that the 
expression be an invariant of /.f But a modular covariant is not in general 
isobaric, though the weights of any two of its terms can differ at most by a 
multiple of p" -- 1 . 

These differences are well shown by the algebraic invariants of 

and the formal invariants of the same form, modulo 3 . All algebraic invariants 
of / are polynomials inj 

and the modular invariants are polynomials in A, J , B and F (see 6). It 
will be noticed that of the modular invariants, only one is isobaric and that is 
the one which is also an algebraic invariant of /; the weights of any terms of 
the others differ at most by multiples of p n - - 1 -= 2 . 

ISOBARIC COVARIANTS 

10. Preliminary lemmas. 

LEMMA I. Let C be a formal modular covariant of a system, S , of binary 
forms and a number of cogredient points under a group G with coefficients in the 
Galois field GF [p n ],of order p n . Then, if C is isobaric, it is invariant modulo p 
under any transformation (1) where the % s and rj s are independent variables. 

For, since C is invariant modulo p under the transformation 

To = T 

A 2 ~ ""2 y 



* In the binary form / = a xf + d xf~ [ x 2 + + a m xf , the weight of the coefficient 
en is defined to be i , and the variables Xi and x 2 are assigned the weights 1 and respectively. 
See any standard book on algebraic invariants, such as Elliott, Algebra of Quantics, first edition, 
pp. 36, 48; Dickson, Algebraic Invariants, pp. 31, 38. 

f For examples, see Elliott, pp. 125-126; Dickson, pp. 36-37. 

t Dickson, Algebraic Invariants, pp. 48, 84; Elliott, loc. cit., pp. 98-99. 
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and is isobaric, it is formally invariant, modulo p , under any transformation 

x\ = x( + kx 2 , 
x 2 = kx 2 , 

where k is a general non-zero scalar. Again using the isobarism, it follows 
from this last that C is invariant modulo p under the transformation 

(?) . Xi ==X{ + kx ,, 

X 2 = X 2 . 

Also, C satisfies the condition for covariancy under the transformation 

X 2 == Xi , 

without using Fermat s theorem. Now, using (7) and (8) in conjunction 
with the isobarism, we prove the lemma. 

LEMMA II. // a formal modular covariant is isobaric, then every one of its 
factors is necessarily isobaric. 

For, let Ci and C 2 be any two formal modular covariants which are not 
isobaric. Then the totality of terms in Ci C 2 which have the greatest (least) 
weight is simply the product of the terms of C\ which have the greatest (least) 
weight by the terms of C 2 which have the greatest (least) weight. The lemma 
follows at once. 

LEMMA III. When a formal modular covariant C is isobaric, each of the 
multipliers Di ( , 77 ) of Theorem II is also isobaric. 

For, by Theorem II, under any transformation (1), we have (3) holding 
formally as to the a s, x s, s and rj s. Since C is isobaric, it is invariant 
whenever x\ or x z is multiplied by p, where p is any non-zero scalar. This 
means that the right member of (3) is homogeneous in 1 and 771, and also in 
2 and 772 . Being also invariant whenever we interchange the x s, the right 
member of (3) must be of the same degree in 1 and 771 that it is in 2 and 772 . 
Hence, if u&gt;i,w 2 , w 3 and 4 are the degrees of any term of some Di in 1 , 771 , 2 
and 772 , respectively, then w\ + w 2 = w 3 + w 4 . Similarly, since C is invariant 
formally whenever x( or x 2 is multiplied by p , we have Wi + w s = w 2 + u\ . 
Therefore w\ = w&lt;i and wz == s 

LEMMA IV. When a formal modular covariant, C , is isobaric, each of the 
multipliers Di of Theorem II is either identically congruent to zero or to a power 
of the determinant, D , of the transformation. 

By Lemma III, Z) t is a linear combination of terms of the form * 77""* " ~* 77? , 
where w is constant for a given covariant, C; and thus each D{ which is not 
congruent to zero for every transformation of the group is of the form* 

* For if Di is not congruent to zero when the s and TJ S are in the field, then it must be 
equal to unity for the identical transformation and hence must contain a term in r^ with 
coefficient unity. 
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(9) &lt; ($1172 -171 fe)" + A, 

where A; is divisible by 771 2 and vanishes whenever the s and ?j s are in the 
field. But Di and 1 772 ?7i 2 are both formal modular invariants of the 
two pairs, ( ;, rn) , and are also isobaric; hence A, is likewise. Being divisible 
by ??i and 2 , A; is divisible by L t = ?" 7/ z - ,- r/f (i = 1 , 2 ) . By Lemma 
II, this is impossible unless A; is identically zero in the field. Finally, each 
Di which is congruent to zero whenever the s and rj s are in the field is of the 
same type as the A; above. Thus the lemma is proved. 

11. Isobaric seminvariants in terms of the roots. If a formal modular 
seminvariant / of a system, S , of binary forms with respect to the field GF [ p n ] 
is isobaric, then, by Lemma I, it is invariant, modulo p, under any trans 
formation 

a-i = x( + Mx 2&gt; 
i 

3-2 = X 2 , 

where M is a general scalar. Hence, as in the theory of algebraic semin 
variants, we can prove 

LEMMA V. Let I be any isobaric formal modular seminvariant of a system, 
S , of binary forms with respect to the Galois field, GF [p n ], of order p n u hich is 
of degree d and weight w . Then I is congruent, modulo p , to the product of a Q by 
a symmetric polynomial, S , in the symbolic ratios 0:2/0:1, fa/fii, which is 
homogeneous in these ratios of degree w . Moreover, S is expressible as a poly 
nomial in the differences of these ratios. 

The proof proceeds as in the theory of classic seminvariants, but (for the 
sake of completeness) we shall reproduce it for the case when S contains only 
one form. Since I is isobaric, it can not contain o as a factor, by Lemma II. 
Hence / is a^ multiplied by a polynomial, S , of degree d in OI/OQ , a 2 /a , , 
Om/ao . Thus S is a polynomial in the elementary symmetric functions of the 
symbolic ratios 

, Oil $1 72 

"- - ~ ~a~ ~ &gt; &gt; 
ai Pi 71 

and is of degree d in each such ratio; and since I is isobaric, S is homogeneous 
in these symbolic ratios of total degree w . 

But, since / is formally unaltered, modulo p, under transformation (10), 
it is formally unaltered, modulo p, when the symbolic ratios are diminished 
by any scalar, M . Accordingly, set 



and similarly with the other ratios; and thus 



Oil 
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where each Ai is a polynomial in the differences such as (182/181) ( 2 /ai), 
(72/71) " (2/i), . As in classic invariant theory, this is impossible 
unless the Ai (i g 1) are all zero in the field. Hence the lemma is proved. 
The converse is also readily proved, but will not be needed here. 

12. Isobaric invariants in terms of the ratios. When I is an isobaric formal 
modular invariant of a binary form,/, then (by Lemma I) its transform under 
any transformation (1) when the s and rj s are general scalars is connected 
with 7 by the relation (2), where (2) is an identical congruence as to the a s 
and thus is essentially of the same type as the corresponding relation for an 
algebraic invariant of/. Hence, as in the theory of algebraic invariants, we 
can prove for invariants the result which is a generalization of Lemma V. 
A similar argument obtains for an invariant of a system of binary forms and 
hence for covariants of a system of forms.* Thus we have 

LEMMA VI. Let C (a formal modular covariant of a system S of binary forms, 
/;) be of degree di in the coefficients of fi and of degree d in the variables, Xi and x 2 , 
and let it be isobaric of weight w . 

I. Then C is identically congruent, modulo p , to LTa o 1 x% multiplied by a linear 
combination, K, of products of the expressions (.Ti/.r 2 ) + (a 2 /ai) , and of dif 
ferences of the type (ft /ft) -- (a 2 /a:i), where the a s and j3 s may be symbols 
arising from the same form, f it or may be symbols arising from two different forms. 

II. Moreover, K is homogeneous in the symbols for each form, /, and such 
that each ratio a^/aifor this form, fi, occurs in exactly d^ factors in each product. 

III. Finally, K is symmetric, modulo p, in these symbols for each form /;. 

In applying the results of 11 and 12, the reader must remember that, 
although the seminvariant (or invariant) is always symmetric in the pairs 
(0:1, a 2 ) , (ft, /3 2 ) , in the form in which it first appears, yet the poly 
nomial, K, in the differences (ft /ft) ( 2 /i) is not in general symmetric 
in the ratios 2 /ai, ft/ft, , but is merely symmetric modulo p. For the 
binary quadratic / = o -?i + a i x i #2 + #2 x\ = a x j3 x has the formal modular 
invariant a x = i ft + 2 ft = ai ft [ (ft /ft) - ("2/0:1) ], modulo 2. The 
first is actually symmetric; but the second is not symmetric, though it is 
symmetric modulo p = 2 . 

13. Isobaric formal covariants as algebraic covariants. From Lemma VI 
it is evident that, if we express a for each form, /; , in terms of the symbols, 
we have 

THEOREM III. Every isobaric formal modular covariant, C , of a system of 
binary forms is expressible as a polynomial in symbolic covariants of the type 
a x = ai xi + 2 o-o and in determinantal symbolic invariants of the type 

* For every formal covariant of the system S is a formal modular invariant of the system 
S consisting of the forms of S and the additional linear form I = -X" 2 Xi Xi x 2 in which 
Xi , X 2 have been replaced by Xi,x 2 , respectively. The proof of this is the same as for algebraic 
covariants. 



300 o. c. HAZLETT [December 



But it is well known that the product, P , of .TO Xiao 1 by any expression, K , 
in the symbols satisfying the conditions I and II of Lemma VI is an algebraic 
covariant of the system S of forms, though not necessarily a rational one.* 
A necessary and sufficient condition that P be rational in the coefficients of /, 
is that K be actually symmetric in the ratios /3 2 //3i , a 2 /ai , formed for /, . 
This is stronger than condition III of Lemma VI, as is illustrated by the 
symbolic invariant (a/3) for the binary quadratic, mod 2 (see 12). The A th 
power of (a/3) is symmetric, mod 2, for any k; but is actually symmetric 
only when k is even. Thus we have 

COROLLARY 1. Every isobaric formal modular covariant of a system, S, of 
binary forms with respect to the Galois field, GF[p n ] of order p n , is congruent, 
modulo p , to an algebraic corariant of S , either rational or irrational. 

But every irrational algebraic covariant of S is known to be a root of an 
algebraic equation whose coefficients are rational, algebraic co variants of S , 
and thus we have 

COROLLARY 2. Every isobaric formal modular covariant of S is congruent, 
modido p , to a root of an algebraic equation whose coefficients are rational algebraic 
covariants of S . 

PSETJDO-ISOBARIC COVARIANTS 

14. Informal discussion. In the preceding sections, we have considered 
formal modular covariants that are actually isobaric and have seen that, in 
many ways, such a covariant is much the same as an ordinary classic covariant. 
But if a formal modular covariant is not isobaric, what then? For example, 
consider the formal modular invariants of a point with respect to the field 
GF [p n ]. Any such invariant, / , whose degree, d , is less than p n + 1 must 
be an ordinary invariant of the point. For, since the weights of any two 
terms would have to be congruent, mod p n -- 1 , / would have to be of the 
form Ax p i n ~ l + Bx%"~ 1 where A and B are constants. But, if the degree is 
&lt; p n , there is no opportunity to apply Fermat s theorem to the coefficients 
of the transformation, and thus / must be invariant in the classic sense and 
therefore identically zero in the field. If p n &lt; d ^ p 2n , it can be shown by a 

* Dickson, Algebraic Invariants, p. 55, ex. 7; Elliott, loc. cit., p. 93. (Some people do not 
agree with my statement, and (in justification) say that P in general has no actual meaning 
in the o s. But any polynomial, P , in the symbols which is not actually symmetric in the 
symbols, i , , 0i , /3, for a particular form, f { , is a root of an algebraic equation whose 
coefficients are the elementary symmetric functions of P and the other symbolic invariants 
obtained from P by interchanging any two pairs of symbols for this form/,; and hence P is a 
root of an algebraic equation whose coefficients are polynomials in the coefficients of/,- . Simi 
larly, P is a root of an algebraic equation whose coefficients are polynomials in the coefficients 
of S , and thus P is an algebraic function of the coefficients of S , though it is not necessarily 
an explicit algebraic function of these coefficients. Hence why is it not an algebraic covariant 
of S?) 
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similar argument that either I is an ordinary invariant of the two cogredient 
pairs, (xi, x 2 ) , ( zf , x% ) , and therefore I = kL where k is a constant and 
L = rcf x-i x\ x^; or LI is an ordinary invariant of the two cogredient 
pairs (&gt;!, z 2 ) and (a* 2 ", zf"), and therefore LI --=k(xf n x 2 -Xixf"). 
In fact, it is well known that all formal modular invariants of the point ( x\ , x z ) 
are polynomials in L and Q = (xf n x z x\ xf")/L. This might be stated 
more strikingly by saying that every such invariant is a rational, integral 
algebraic invariant of three points, or is a quotient of two such invariants. 
Moreover, a similar statement is true of the formal modular invariants of a 
pair of cogredient points, (x\, x 2 ) and (y\, y z ) .* Is this true more generally? 
15. Preliminary lemmas. Consider a formal modular invariant, 7 = 2Z/j , 
of the binary form /(a; x) which is of degree d and which is not isobaric. In 
particular, let 7o be a term of I whose weight, WQ , is less than or equal to that 
of any other term of / , and let ,/i be any other term of I . Then, since I is 
pseudo-isobaric, the weight of 7i will be w\ -- WQ + l(p n -- 1 ) where I is an 
integer, positive or zero. When the coefficients of /(a; x} are replaced by 
the corresponding functions of the symbols a\ , a 2 , j8i , /3 2 , , 7 is homo 
geneous of degree md in these symbols. Also, if /_, is of weight w,, it is of 
degree md Wj in symbols with subscript 1 and of degree w, in symbols with 
subscript 2. Their difference, md 2wj , we shall call the excess, as in classic 
invariant theory. 

LEMMA VII. The excess of any term of a formal modular covariant is con 
gruent to zero, modulo p n -- 1 . t 

For, by interchanging the subscripts 1 and 2 throughout 7, it follows that 
7 contains a term whose degree in the symbols with subscript 1 is Wj and whose 
degree in the symbols with subscript 2 is md W,, and therefore of weight 
md Wj. Hence md Wj = iv, (mod p n 1 ) , and the excess of any term 
of 7 is a multiple of p n - 1 , say Ej ( p n - 1 ) - 

Now I p " is identically congruent in the field to ]C7 ; j n ,t and thus the latter 
is a formal modular invariant of /(a; x) . It may also be regarded as an in 
variant of f(a pn ; x) , but we shall show that it is better for our purpose to 
regard it as a simultaneous invariant of /(a; x) and/(a p "; x) . 

For in 7 P " the term 7{f is of excess E p n (p n -- 1 ) , and hence, if we replace 
E (p n 1) factors of the form a 1 ? by as many factors of the form en, the 
excess of the resulting symbolic expression is E (p n -- 1 ) . If w = E , then 
by replacing E additional factors of the form of by as many factors ai, 

* W. C. Krathwohl, Modular invariants of two pairs of cogredient variables, American 
Journal of Mathematics, vol. 36 (1914), pp. 449-460. 

t After finishing this MS., I discovered that Professor Glenn stated and proved this in 
Concerning an analogy between formal modular invariants and the class of algebraic invariants 
called Booleans, American Journal of Mathematics, vol. 37 (1915), p. 75. 

J Dickson, Linear Groups, p. 15. 
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the excess of the resulting symbolic expression is zero. 7*f is now of degree 
E p n in a s having subscript 1 and of degree zero in a s having subscript 2; 
also, it is of degree ( WQ E )p n in a s having subscript 1 and of degree wg p n 
in the a s having subscript 2 . Here and elsewhere in this work, we use the 
expression "symbols a s" to denote any and all symbols such as ai , a 2 , Pi , 71 , 
; similarly with the expression "symbols a." 

Moreover, if W Q = EQ , the same device will serve to make all terms of I pn 
of excess zero and also to make all terms of the same weight, w? . For the 
weight of any term, /, of / is ic + k(p n -- 1) and hence is of excess 
("0 2/o) (p n -- 1 ) which we shall assume positive for convenience. Accord 
ingly, if in If we replace ( E 2k ) ( p n ) factors of the form a /" by as many 
factors ai , the result is of excess zero, but of weight [ + k ( i&gt;" -- 1 ) ] p" . 
Next replace kp n factors af by as many factors a\ and replace kp n factors 
OL P Z by as many factors a 2 , and we get an expression whose excess is zero and 
weight is WQ p n . This is legal, since WQ E k + 1 and w = A . In case 
the excess of / is negative, interchange the subscripts 1 and 2 in the foregoing 
by an argument similar to that used in Lemma VII. Thus the remark at the 
beginning of this paragraph is proved. 

For convenience, we shall speak of the a s, /3 s, (the symbolic coefficients 
of the linear factors a x , j3 x , of /(; .r)) as the primary symbols; and we 
shall speak of the totality of the original symbols cci, a 2 , /?i, j8 2 , together 
with the new symbols ai , a 2 , /3i , /? 2 , , and any other symbolic coefficients 
introduced in the above manner as the secondary symbols. 

We see, accordingly, that every pseudo-isobaric formal modular invariant, 
7, of /(o; x) is such that a suitable power of 7 is isobaric in the coefficients of 
the symbolic linear factors of / ( a; x ) and of/ ( a pn ; x ) , provided ^ WQ S Eg . 
Even if ^ w &lt; E , this argument shows that the excess of I p " in the 
secondary symbols is (E o) (p n -- 1) less than that of the original 7. 
But, since the excess is finite, it follows that, by repeating this process at most 
[Eo/wo]* times, we prove Lemma VIII for the case WQ ^ 0. It will be ob 
served, however, that for this purpose we have to introduce another set of new 
symbols, say aj , a 2 , ft , j3 2 , , and possibly still other sets having three or 
more dashes. These, together with those introduced above, we group under 
the heading "secondary symbols." 

When w = 0, we first multiply 7 by any invariant which is not an absolute 
invariant and then proceed as above. Thus we have 

LEMMA VIII. Let I be a pseudo-isobaric invariant of a binary form 

f(a;x) = lLa x 

with respect to the Galois field, GF [p n ], of order p n ; and let the lowest weight of all 
* Here [ E O /WO ] means the smallest integer which is not less than E O /WO . 
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the terms of I be denoted by w . Then, if w ^ , some finite power of I is con 
gruent to a polynomial in the secondary symbols for f which is isobaric. If 
iv = , then some power of I is congruent to the quotient of tico polynomials in 
the secondary symbols each of which is isobaric For examples, see Part IV. 

16. Pseudo-isobaric modular covariants as algebraic covariants. Let J de 
note that formal modular invariant of/ which is obtained as a result of applying 
the method of Lemma VIII to the pseudo-isobaric invariant / and which is, 
accordingly, isobaric of weight w in the secondary symbols. If we apply to 
J the transformation (1), then the transform J = J ( a ) is identically equal to 

(12) J(a ;a ;a ; )== Eft (, r?) P k (a; a; a; ) + r 



where each Dk (i) is a formal modular invariant of the two pairs (1, 77 j) and 
( 2 , f?2 ) , ( ii) is congruent to zero or to D w whenever the s and ry s are all in 
the field, (iii) is isobaric of weight w in the s and ij s; and where r is a poly 
nomial in the s and rj s and in the secondary symbols which vanishes whenever 
the dashed symbols are replaced by the proper powers of the corresponding 
primary symbols. That is, J is not in general an invariant function of the 
secondary symbols when these symbols are taken as independent, but it is an 
invariant function when the dashed symbols are replaced by the proper ex 
pressions in the primary symbols. 

Since J is isobaric in the secondary symbols, Lemmas I-IV apply to J . If 
we proceed to Lemma V, we see that the proof applies to J with the exception 
that, since J is not in general rational in the coefficients of f (a; x~) ,f(a p "; x) , 
f(a p *"; x) , , then J is not in general congruent (modulo p) to the product 
of y by a symmetric polynomial, S , in the secondary symbolic ratios; but, 
nevertheless, J is congruent (modulo p) to a product of secondary symbols 
with subscript 1 by a polynomial, S , in the secondary symbolic ratios. Hence, 
as before, S is expressible as a polynomial in the differences of the secondary 
symbolic ratios. Thus the proof of Lemma VI applies to J with the restric 
tions noted. 

Hence we prove Theorem III and its corollaries for J , and we have 
THEOREM IV. // / is a formal modular covariant of a system of binary forms 
with respect to the Galois field GF [ p n ] of order p n , then I q (where qisa sufficiently 
large integer) is congruent to one of the two folloiving forms : 

(1) a polynomial in symbolic covariants of the types a x , a x , A x , (a/3) , (Aj3) , 
etc., where X = x p " and A = a p "; 

(2) a quotient of two polynomials described in (1). 

Since these symbolic expressions are (symbolic) algebraic covariants of 
/(a; x) ,f(a pn ; x) , etc., we have 

COROLLARY 1. Every pseudo-isobaric formal modular covariant, C , of a 
system S of binary forms with respect to the Galois field GF [ p n ] is congruent, 
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modulo p , to a symbolic algebraic covariant, C , of the forms of S and certain 
related forms obtained from the forms of S by replacing each coefficient of these 
forms by its p n th power. 

In particular, this means that the function J may be chosen in such a way 
that, under the transformation (1), &lt;7 goes into J , where J is equal to (not 
merely congruent to) D w J , when J is expressed as an isobaric function of the 
secondary symbols. Note, however, that J is not necessarily rational in the 
coefficients of the forms of S and the related forms. These remarks are 
illustrated by examples given in Part IV. 

17. Finiteness of determinantal symbols. In the preceding section, we 
proved that all formal modular invariants I of a binary system, S , are con 
gruent, modulo p , to rational functions in symbolic invariants of S of a certain 
special type called determinantal symbols. That is, I is congruent, modulo p , 
to a rational function of symbolic invariants of the types ( a/3 ) , ( a p " a. ) and 
symbols obtained from these by replacing a pair of symbols by then- p n th 
powers one or more times. Conceivably, as the degree of / increases, the 
degree of the requisite determinantal symbols might increase without bound, 
so that we would need an infinite number of determinantal symbols to obtain 
all formal modular invariants of S . 

But, in Part II, we proved that all formal modular invariants of S are con 
gruent to polynomials in a finite number of symbolic invariants which are 
formal modular invariants of the (symbolic) coefficients of the symbolic linear 
factors of the forms of S . Now apply Theorem IV to these symbolic in 
variants, and we see that all symbolic invariants of S are congruent to rational 
functions of determinantal symbols. Hence we have 

THEOREM V. All formal modular covariants of a system of binary forms are 
congruent, in the field, to rational functions of a finite number of symbolic Co- 
variants of the types described in Theorem IV. 

In fact, this theorem could have have been proved directly in much the 
same manner as Theorem IV was proved and then Theorem IV would have 
followed as a corollary of Theorems I and V. 

PART IV. EXPRESSION or COVARIANTS IN SYMBOLIC FORM 

18. Method for algebraic covariants. In the symbolic theory of algebraic 
covariants, after proving that every such covariant is expressible as a poly 
nomial in symbols of the two types, ( a/3 ) and a x , there is then given a clear- 
cut method for expressing any covariant in terms of these symbolic covariants. 
For completeness, this method will be reproduced here, as we shall need to 
refer to several of the results. 

Let I be any algebraic invariant, of weight w, of a system of forms whose 
coefficients are the a s; and let I (a ) be its transform under (1). Under this 
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transformation, a\ and o? 2 go into a { and a,, respectively; and similarly for 
the j3 s, 7 5 s, etc. Just as the a s are polynomials in ai , a 2 , /?i , j3 2 , , so 
their transforms are polynomials in or f , a,, /8 f , /3,, . Hence, in the equa 
tion (2), 

( i ) replace every a by the corresponding polynomial in a f , a, , etc. ; 
(ii) operate on both sides of the resulting equation w times with 

V = : 



(iii) divide both sides of the result of (ii) by (w + 1 ) ! wl . 
From the left we get a polynomial in the symbolic invariants of the type 
( a/3 ) , and from the right we get 7 ( a ) . 

In proving this, use is made of the following: 

LEMMA A. 

= ( W + 1)1 W l. 



LEMMA B. The result of operating by V w on a product of k terms of type a { 
and I terms of the type /?, is a sum of terms each containing k w factors a% , 
I w factors j8, and w factors ( cn/3 ) . 

19. Difficulties in modular case. It is at once evident that this method is 
not applicable directly to formal modular invariants, since the divisor in ( iii ) 
is congruent to zero in the field whenever w = p I . This difficulty, of 
course, might be superficial in the sense that it might be due to some accidental 
peculiarity in the proof and not to any essential characteristic of the problem; 
but, as will be apparent presently, there is a deep-seated difficulty. 

Lemma B is proved by direct verification combined with induction, and 
thus is lost the full significance of the reason why it is true. Now each symbolic 
expression a% and each /3, is an invariant of the two points ( 1 , 2 ) and ( rji , 7/ 2 ) , 
which are transformed cogrediently when the original variables are subjected 
to any linear transformation 



Xj = KXi 2 , 

(13) A = KV \n 9^ 0, 

X Z = fiXi + VXz , 

thus replacing (1) by 

xi = & x[ 



where the s and rj s are cogredient with the x s. Moreover, V is an invari- 
antive operator whenever the s and T? S are transformed cogrediently. Lemma 
B is an inevitable consequence of these facts. 

If we look at Theorem II ( 8), we see that in the modular case we have 
the identity (4) where each Z),- is a formal modular invariant of (|i , r?i ) and 
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( 2 , 772) when they are transformed cogrediently; but Di is not in general a 
formal modular invariant of the two pairs (1, 2 ) and (771, 770), though it 
is clearly a modular invariant of these pairs. Moreover, the only way to 
operate on a product of a number of symbols of the type $ and /3, in such a 
way as (i) to eliminate the s and TJ S and (ii) to obtain as result a formal 
modular invariant of the as, jS s, etc., is to use an operator which has the in- 
variantive property when ( x , 2 ) and ( 771 , 770 ) are transformed cogrediently. 
Now the natural analogues of V for the formal modular case are, by Theorem 
II, those formed on 

1" 772 771 2 &gt; etc., 

as models i.e., by replacing each by the operator indicating partial differ 
entiation with respect to that , and similarly with the TJ S but they can not 
give the desired results since these operators are not invariantive when the s 
are transformed cogrediently with the 77 s. 

20. Significance of results of Part III. Such is the essential difficulty of the 
problem and such is one of the reasons for proving the theorems of Part III. 
There it is shown that ( i ) if C is isobaric, then each Di is an algebraic invariant 
of the two pairs ( 1 , 771 ) and ( 2 , rj 2 ) and hence is an invariant of the two pairs 
(ii 2) and (TJX, 772); (ii) if C is not isobaric, then a suitably high power of 
C , say C q , is such that, when (2) is formed for C Q , then each Di is an algebraic 
invariant of the two pairs ( 1 , 771 ) and ( 2 , 772 ) and hence is an invariant of 
the pairs ( j , 2 ) and ( 771 , r? 2 ) . In view of the remarks of 19, these results 
smooth out some of the difficulties. 

21. First method. Let I be any formal modular invariant of a system S of 
binary forms with respect to the Galois field, GF [p n ], of order p n , and let J 
be a formal modular invariant of S which is obtained from 7 by applying the 
method of Lemma VIII and which is isobaric in the secondary symbols. The 
first method of expressing J in symbolic form is suggested by the proof of 
Lemma V (11). Although it lacks originality, it has the advantage that it 
leads at once to a result without exception or qualification. If J is of degree 
di in the coefficients of / , divide J by iraf where the a\ range over all symbols 
for fi having subscript 1 . Do this for every form /,- . The quotient, S , is 
a polynomial in the ratios 0.1 /a\, /3 2 /|8i, , 0:2/0:1, with coefficients which 
are marks of the field. Single out any particular ratio of the primary symbols, 
say az/ai, and select all terms of S which do not contain 0:2/0:! as a factor. 
.In the resulting expression, replace /3 2 //3i by (/3 2 //3i) (a 2 /ai), af/a C by 
(a"/ a D " (#2/0:1), an d similarly for all symbolic ratios. The resulting 
expression, E , is clearly an invariant, since it is a polynomial in the determi- 
nantal symbols. Moreover, it is identically congruent, modulo p , to J , for it 
is simply A in equation (11). 
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22. Second method. It is usually easy to determine by inspection a product, 
TT , of determinantal symbols in such a way that the terms of TT which have 
the greatest excess are among the terms of J which have the greatest excess; 
and hence, by a finite number of steps, we can express J in symbolic form. 
No attempt has been made to crystallize this into a definitive, systematic 
process, though possibly this might be done. 

For example, consider a xl + 2ai xi .r 2 + a 2 xL Modulo 3, this has the 
formal modular invariant 



B = ai ((?! o ) (i + o ) (2o + 02) (2i + 2 ) (01 + er 2 ) 

= (aift + 2ft) (ift + a(3 L + a^ft) (on ft + 2 ft - aift) 

X ( - i ft + 2 ft ) ( i ft + 0:2 ft + Q!2 ft ) ( ai ft + a 2 ft - 0*2 ft ) 
= L? /3? /3i - La? ft $ + Laf /3? at 131 

- Z? # i 0! + La? 2 & - La? i 2 

= L( 3 /?) 2 (/3 3 /3) (mod 3). 

The terms of greatest excess in B are La? 0? (82 which suggests that we use to 
"kill off" these terms L ( 3 |8) 2 (|3 3 /3), since the only way to express af/3?|8i 
in the secondary symbols i , a 2 , ft , ft , a? , ai , jS? , /Si in such a way that it 
shall be isobaric in these secondary symbols is a? ft a? ft /3 s ft Thus we take 
as TT, (o 3 ^) 2 (/3 3 /3). It is then readily discovered that L (a 3 /3) 2 (/3 3 /3) 
accounts for all terms, modulo 3 . 

23. Third method. The preceding method is closely related to the one 
about to be explained. Although the second method has none of that gener 
ality which attracts the artist in pure mathematics, yet it has the advantage 
in any concrete case, while the present method is to be preferred in any general 
argument. 

Under transformation (1), / is carried into 

(15) J(a")==D w J(a) (mod p) 

where this is an identical congruence in the a s and the coefficients of the 
transformation. Moreover, it is understood that J (a 1 ) is so written that it is 
isobaric of weight w in the secondary symbols as in Lemma VIII. Thus, if 
each a is replaced by its expression in terms of the a^ and a,, a^ and a,, etc., 
the left member of (15) is the sum of a number of terms each of which is a 
product of w linear functions of the s and w linear functions of the ij s. 

Now operate on both sides of (15) with V w as given in (ii), 18. Since 
the left member is isobaric in the secondary symbols, we can apply Lemma B. 
Hence, from the left member we get a sum of a number of terms, each of which 
is a product of w determinantal symbols such as ( a(3 ) , ( a pn ft ) , ( a p " a ) , etc. 
From the right we get ( w + 1 ) ! w I J ( a ) . Call the result equation E . 
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If(w + l)!w!sO( modulo p ) , then the totality of those terms on the 
right of equation E whose coefficients are not congruent to zero, modulo p, 
constitute a syzygy, modulo p. Moreover, since the right member of (15) 
is symmetric, modulo p, in the primary symbols, this syzygy must be sym 
metric, modulo p , in these symbols and hence congruent to a function which is 
rational in the coefficients of the ground forms, / . Hence, if we know the 
necessary syzygies or use discretion, we can replace equation E by one (i) 
which is equivalent to E and (ii) which has each coefficient congruent to 
zero, modulo p . If we now cancel p from both sides of this equation, we have 
an equation, EI, of the same type as E, but such that the multiplier on the 
right is not divisible by so high a power of p . Now treat equation E\ as we 
treated E above. These steps are all justified, since / is an algebraic invariant 
of certain forms. 

24. Illustrations. A fundamental system of formal modular invariants, 
modulo 3 , of the binary quadratic was expressed in symbolic form in 6. 

It is known that a fundamental set of formal modular invariants, modulo 2 , 
of the cubic 

do x\ + 01 x\ x z + 2 Xi x\ + a 3 x 3 &gt; 

= Tra x = ( i .TI + &lt;*2 #2 ) ( ft .TI + ft x 2 ) ( Ti #1 + 72 X 2 ) 
is 

K = cfi + a 2 , A = a as + i 02 , I = al + a K + 8 00 , 

k = a 6 00 , g = $ + j8 ( A + A 2 ) + ( A + 6 00 ) (/3 + a K + K 2 ) 

where 

J3 = o i + al, 



After a little computation, it is readily seen that 



q j 7 | , A 
"(o 2 )(/3 2 ^)(7 2 7) 
fc= ( 2 a)(/3 2 /3)(7 2 7). 

But every attempt to express K as a polynomial in the determinantal symbols 
is bound to fail, since it is not possible to express K as an isobaric function of 
the secondary symbols. In fact, K is the invariant which forced the author 
to realize Theorem IV and the lemmas that lead thereto. It is easy to show 
that 

K 2 = E (a 2 7) ( 2 7) + (a/3) 



Similarly, it is not possible to express g as a polynomial in determinantal 
symbols, though (by Theorem IV) a power of g is so expressible. 
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The author has also expressed some covariants of these two forms in symbolic 
form, but the amount of numerical work necessary to do this seemed to increase 
rapidly with the degree of the covariant called J in Part III. In short, the 
author would not recommend that the theory of Part III be applied in detail 
to any but the simplest concrete case, unless there should prove to be some 
powerful systematic procedure not yet discovered. 

PART V. FINITENESS THEOREMS 

25. Preliminary. In Part III, we saw that every formal modular covariant 
of a system, S , of binary forms with respect to the Galois field GF [p n ], 
of order p n , is expressible as a polynomial in a finite number of symbolic Co- 
variants. Moreover, if the covariant is rational and integral, it must be such 
that when expressed in terms of these symbolic covariants it is unaltered, 
modulo p , when any two pairs of primary symbols, ( a v , a 2 ) and ( /3i , /3 2 ) , are 
interchanged. It must be carefully noted, however, that this is not equivalent 
to saying that it is symmetric in the symbolic covariants. 

Using this result, we can prove that the set of all formal modular covariants 
of S which are rational and integral possesses the finiteness property i.e., 
they are all expressible as polynomials in a finite number of these covariants. 
The proof given below is Hilbert s proof of the finiteness theorem for algebraic 
covariants, with only one slight modification. The theorem is proved for 
invariants of a general binary system, S , since the covariants of a system S are 
coextensive with the invariants of another system S . 

It is readily seen that Hilbert s beautiful proof,* when shorn of non-es 
sentials, depends entirely on the following properties of algebraic invariants: 

I. All invariants are polynomials in a finite number of polynomials, P , in 
a certain finite set of pairs of auxiliary variables. Since these polynomials, P , 
are invariants under the group, we shall call them elemental invariants. In the 
case of algebraic invariants, these auxiliary variables are the coefficients of 
the symbolic linear factors of the forms of 5 and the elemental invariants are 
the determinantal symbolic invariants of type ( a/3 ) . 

II. If the invariant is rational and integral, it must be such that, when 
expressed in terms of the elemental invariants, it is unaltered (with respect 
to the field of definition) when any two pairs of the auxiliary variables are 
interchanged. 

His proof uses the following theorem about diophantine equations: A system 
of any number of linear diophantine equations has only a finite number of 
simple sets of solutions, all other solutions being sums of positive multiples of 
this finite number of sets of solutions. 

* Ueber die Endlichkeit des Invariantensystems fur Unare Grundformen, Mathema- 
tische Annalen, vol. 33 (1889), pp. 223-226. 

Trans. Am. Math. Soc. 21. 
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Now Part II proves that these two properties hold for formal modular 
invariants where the auxiliary variables are the primary symbols a.\ , 2 , &\ , @z , 
and where the elemental invariants are the finite number of symbolic 
invariants of Theorem I. Moreover, Hilbert s proof makes no use of properties 
peculiar to a field of characteristic zero except in one place which is readily 
avoided. Thus his proof applies here. 

20. Finiteness of covariants. For convenience, let 7i , 1% , , 7 M be a 
fundamental set of symbolic formal modular invariants of S . Then every 
formal modular invariant of S is expressible in the form 

(16) / == J/,- 7, s /? /;" 

where the exponents, Sj , are positive integers or zero and the coefficients, J7, , 
are marks of the Galois field, GF[p n ] . Necessary and sufficient conditions 
that 7 be a rational and integral function of the coefficients of the forms of S 
are that ( i ) when the right member of (16) is expanded, it is a function of the 
pairs (oil , 2 ) , (/3i, /3 2 ) , which is symmetric, modulo p, (ii) each term 
of this expansion contains the same number of a s as /3 s, the same number of 
/S s as 7 s, etc. 

If dj (a) denotes the degree of 7, in the pair ( ai , a 2 ) . then the last condition 
gives a set of linear diophantine equations of which two are 

d = Si dt ( a ) + * 2 cfo ( a ) + + s lt d lt (a) 
= *i(M/3) +*2&lt;M/3) + +*&lt;*()&gt; 

there being one such equation for every pair of symbols. By the theory of 
linear diophantine equations, it follows that every rational integral invariant, 
/, is expressible in the form 

(18) / == ZNiC sC j-- C /, 



where each C{ is a symbolic invariant of S which has the property (ii) above 
and where (iii) there is one (7, corresponding to each simple set of solutions 
of the set of diophantine equations (17), and (iv) the d are the same for all 
invariants, 7 , of the system S . 

Moreover, if cr ( - is the number of the symbolic invariants which are not 
identically congruent, modulo p, and which are obtained from a particular 
d by interchanging any two pairs of symbols, then d satisfies an algebraic 
equation of degree o-; where the coefficients of the various terms are (aside from 
sign) the elementary symmetric functions of d and its conjugates. Since d 
and therefore each of its conjugates has the property (ii) , these coefficients, 
Kj, are formal modular invariants of S w y hich are rational and integral in the 
a s. Hence 7 may be written 

(19) 7 = Y.Lj (EQi C{ Ci C /) (0 i tt &lt; o-,) 
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where the Q s are in the field and where the summation in the parentheses is 
symmetric, modulo p , in the pairs of symbols ( i , a 2 ) , (ft , /3 2 ) , etc. More 
over, the Lj are polynomials in a finite set of rational, integral formal modular 
invariants, viz., the K s. 

Hence by referring to the sufficient conditions given above that a symbolic 
invariant be rational and integral, we have 

THEOREM VI. Let S be a system of binary forms with coefficients which are 
independent variables and let G be the total group of linear transformations on the 
variables whose coefficients are marks of the Galois field, GF[p n ], of order p n . 
Then all formal modular covariants of S under the group G are expressible as 
polynomials in a finite number of such covariants. 

"21. Finiteness of syzygies. By using, as in classic invariant theory, 
Hilbert s useful theorem about an infinite sequence of polynomials, we can 
show that every syzygy, S == 0, among the covariants of the system S is of 
the form 

S = 



where (i) the H, are formal modular covariants of S; 

( ii ) the Sj are polynomials in the formal modular covariants such that 
Sj = is a syzygy among the formal modular covariants of S; 
( iii ) the Sj are finite in number and are the same for all syzygies. 
Thus we have 

THEOREM VII. The syzygies among the formal modular covariants of the 
system S possess the finiteness property in the sense that all of them are con 
sequences of a finite number of them, Si = , S 2 , , Sk = . 

MOUNT HOLYOKE COLLEGE, 
SOUTH HADLEY, MASS. 



ON THE MEAN-VALUE THEOREM CORRESPONDING TO A GIVEN 
LINEAR HOMOGENEOUS DIFFERENTIAL EQUATION* 

BY 

G. POLYA 
I consider the linear homogeneous differential expression of order n 

Lf(x) s /&lt;-&gt; (x) + fa (*)/&lt;- UO + fa (*)/ ( "- 2) (x) ++*. (*)/(*) 

I suppose fa (x) , fa (x) , , 4&gt; n (x) are continuous, /(.?) is differentiate 
n times, the variable x and all the functions in question are real. I then prove 
some theorems concerning the equation 



which are, I believe, new, but there is one special case of them which belongs to 
the most classical part of the differential calculus. This is the case of the 
simplest equation of the form (1), that is 

5?-- 

Denote the Wronskian 



/*(*) 



by W(fi (x) , / 2 (x) , , f n (.! )) The equation (1) possesses in certain 
intervals the 

Property W . There exist n -- 1 integrals hi (x) , hi (x) , , h n -i (.c) of 
the homogeneous equation (1) satisfying the n 1 inequalities 

(3) /h(z) &gt;0, JF(M*), Ms)) &gt; 0, 

,PF(Mz),MaO, ,/*-iU)) &gt;0 

throughout the open interval (a, b~) . 

Determine n -- 1 particular integrals of (1) by such initial conditions that 
the n -- 1 Wronskians in question are = 1 for x = XQ; then the inequalities 
(3) are satisfied in a sufficiently small interval surrounding the point XQ . 

* Presented to the Society, October 27, 1923. 
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Equation (2) possesses the property W in the interval (-, + &gt;) since 
the functions 1 , .r , .r 2 , , x n ~ 2 are integrals of (2) and we have 

1 &gt; 0, (!,. r) = 1! &gt; 0, W(l,x,x 2 ) = 112! &gt; 0, -. 
The equation 
(4) y" + y = 

is of the second order. Thus there is only one inequality (3), which is satisfied 
by the integral sin ( x a ) for a &lt; x &lt; a + TT . No integral of the equation 
(4) is &gt; throughout an interval (a, b) where b a &gt; TT . Therefore the 
property W belongs to (4) in any interval of length S TT but in no interval of 
length &gt; TT . An equation y + 4&gt;y = of the first order possesses the property 
W in any interval where is continuous. 

HV assume now (and also in 3-5) the existence of the property W . Then 
we have the following theorems I, II, III: 

THEOREM I. // the function f (x) , supposed differentiate n times, vanishes 
at n + 1 points of the interval (a, b) , then there exists an intermediate point % 
such that 

/() = j* (0 + *i ()/&lt; -"(?) + +*(?)/() = 0. 

Theorem I is a generalization of Rolle s theorem. Apply I to the equation 
(2) putting n = 1 ; then we have the ordinary theorem of Rolle. 

THEOREM II. There exists one and only one integral H ( x ) of the homogeneous 
equation (1) assuming n given values at n given points of the interval (a, b) . 

Theorem II asserts the possibility of a certain interpolation; in the case of 
equation (2) and if the n points are distinct, it asserts the possibility of 
solving the problem which is solved explicitly by Lagrange s formula. 

THEOREM III. Determine an integral H (x) of the homogeneous equation (1) 
assuming the same values asf(x) at n given points of (a, b); determine further 
an integral N ( x ) of the non-homogeneous equation 



that vanishes at the n points in question. There exists a point % intermediate 
between these n points and an arbitrary point x of (a, b) such that 



f(x) = H(x) + N( 

Take the equation (2) and n coincident points; in this particular case 
Theorem III gives Taylor s formula with Lagrange s remainder-term. 

In all the theorems just stated coinciding points are admissible; this will be 
fully explained in 1. 2 enumerates some useful formulas concerning 



314 GEORG POLYA [December 

Wronskians. The proofs of Theorems I, II, III and a few illustrative ex 
amples are given in 3-5. In 6 I try to justify the definition of property 
W by proving the converse of Theorem II. 7 deals with a system of several 
functions of one variable and generalizes Rolle s theorem in another manner. 
1. The f unction /( x ) is supposed differentiate n times. We say that /(a;) 
vanishes at k points coinciding with .TO if 

(G) /(.TO) =/ (.r ) = =/&lt;*- &gt; (.r ) = 0. 

This statement has a definite meaning if k ~ n + 1 . If we say that / ( .r ) 
vanishes at k and no more points coinciding with XQ, then we add to (6) the 
condition 



and we suppose k ^ n. If /(.r) vanishes at 

(7) /;/,, w 2 , m 3 , , mi 
points coinciding with 

(8) xi, x 2 , x 3 , , xi 
respectively where 

(9) a &lt; xi &lt; .1-2 &lt; x 3 &lt; &lt; xi &lt; b 

then we say that/(.r) vanishes at mi + m z + + mi points of the interval 
(a, b ) . If we say that/(.r) and (a 1 ) assume the same values at n points of 
( a , b ) we mean that / ( x ) &lt;/&gt; ( x ) vanishes at n points of ( a , b ) . Theorem 
II may be stated more explicitly thus: if there are given I points (8) satisfying 
(9), / positive integers (7) such that ?HI + t 2 + + nil == n, and finally 
I systems of values 

yi, y(, y i, , yT^, 
yt, y z, y", -, y ( ?*~ l) , 

i, 11" ,,(&gt;;- 1) 

yi&gt; Vit Vn &gt; yi &gt; 

then there exists an integral // (a,-) of (1) satisfying the conditions 

#(3-0 == #! , H ( xi ) = yj , , ff(i- ( Xl ) == y?"-", 

H(x t ) - H (x,} -=y 2 , //&lt;^&gt;(.r. 2 ) = 

H(x l }=y l , H (x l }=y l , , 



and H (x} is completely determined by these conditions. 

A point satisfying the inequality ,ri &lt; &lt; xi is said to be intermediate 
between the points (8) ; so far we have supposed / &gt; 1 . If I = 1 , that is, if 
all the points in question coincide with one point xi , then = .vj is the only 
intermediate point. 
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I enumerate a few evident propositions used in what follows. The functions 
u, tii, , Uk are supposed continuous and differentiate as many times as 
required to calculate the expression 



(11) 

d_ d_ _rf_ d_ d k f 



1- #/(* ) vanishes at k (and no more) points coinciding with x , then f (x) 
vanishes at k -- 1 (and no more) points coinciding with x . 

2. Iff(x) vanishes at k points coinciding ivith x , then u ( x )f( .r ) vanishes at 
k (or more) points coinciding with .TO . 

3. Suppose f ( x ) vanishes at k and no more points coinciding with x , and 
suppose further that u,ui,u 2 , , u k are different from zero for x = x . Then 
the expression (11) is ^ for x = .TO. 

4. Suppose f(x) vanishes at k + 1 points. Then, between these k + 1 points, 
there exists an intermediate point % at which the expression (11) vanishes. 

We obtain 4 by repeated application of Rolle s theorem and of Theorems 
1,2. 

2. Let ci, cz, , c n -i be constants; then we have 

(12) W ^ l f- " f H ~ l Cl $ l + C ~ f~ + " + Cn-lfn-l + fn ) 

= TH/1,/2, , /-!,/). 

Suppose the function u is differentiate n -- 1 times; then we have 

W(uf lf uf 2 , -,?//) = 7tIT T (/i,/2, ,/). 
In particular put u = !//:; we obtain 



For the derivative of a Wronskian we have 



Using (14) we obtain by the usual formula for- a minor of the adjoint de 
terminant 

W(fi, ..-,/ n _ 2 ,/ n _ 1 )IP(/ 1 , ...,/_,/) 

- W(f lt ,f n -,,f n -i)W(f l , ..-,/ n _ 2 ,/ n ) 

= W(f lt .-.,/ n _ 2 )ir(/ 1 , ..-, / n _ 2 ,/-l,/n), 

d W(fi, ..,/ B _ 2 ,/ B ) 

^^(/!, -..,/^ 2 ,/ B _ 1 ) 
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If hi (x) , h 2 (x) , , h n (x) are n linearly independent integrals of (1) , we 
have 

r _ W(hi, kt, , h n ,y) 

* ^ = TT/ / . - 1~ i \ 

W ( hi , h 2 , , h n ) 



(17) W(ln,ht, -,*) =ce 
where c is a constant. Put 

y= Fo, H- (*i,y) = Y lt 

H (Ai, /J 2 , j/) = Y z , , W(hi, h*, , h n , y) = Y n , 

l = W 0t h l = W l , 

W(h l ,h 2 ) = tt z , , W(hi,ht f --,hn) = W n . 

Applying (15) we obtain 

d_Y WpYi (t^Y^ W\Y t d Yn-i ^ W n ^ Y n 

dxWi~ W\ dxW~ W\ dx W n Wl 

Combining these formulas with (16) we find finally 

risi L Wn d WLl d wl d wl d y 

~ W n ^ dx PF n _ 2 W n dx U\ Ff 8 dx W W 2 dx W\ 

(18) is the usual decomposition of a linear homogeneous differential expression 
into a "product" of differential expressions of the first order. The proof here 
given is perhaps more direct than the usual.* 

3. The property W implies the non-vanishing of W\ , TT 2 , , W n -i appear 
ing in formula (18). W n vanishes nowhere, by virtue of (17). Applying 
Theorem 4 of 1 to the decomposition (18) we get at once Theorem I. 

Apply now Theorem I to/(.r ) ~ H (x) instead of /(.r) , where H (x} is an 
integral ef the homogeneous equation (1). Then LH (x) = and we obtain 
the corollary 

THEOREM !. // the function f (x ) , supposed differentiate n times, assumes 
at n + 1 points of the interval (a , b) the same values as an integral of the homo 
geneous equation (1) , there exists an intermediate point % such that 

Af() =/ (n) m+&lt;M)/ ( "- 1) U t ) + +$()/() =0. 

Theorem I* was incidentally remarked by H. Poincar-ef in the ease of an 
equation of the second order where the property W is particularly simple, 
including only one inequality. Poincare s proof is very different from the 
prwf given here and more complicated. I obtained the results presented in 

* Cf. L. Schlesinger, Handbuch der Theorie der Unearen Differentialgleickungen, vol. I, p. 52, 
formula (14). 

fL Intermediaire des Mathematicians, vol. 1 (1894), pp. 141-144 and 
also pp. 69, 127, 172, 216. 
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this paper in clearing up and working out the original remark made by Poincare. 

Another corollary to Theorem I is the following 

THEOREM I**. No integral N (x) of the non-homogeneous equation (5) 
vanishes at more than n points of the interval (a, b ) . 

Suppose, if possible, that N (x) vanishes at re + 1 points; then we have 
by Theorem I 

LN(i-) = 



at an intermediate point ; now, actually, LN (%) = 1 and thus Theorem 
is proved. 

4. The general integral of (1) has the form 

// ( .r ) = d hi ( .r ) + c 2 h z ( x ) + + c n h n ( x ) , 

where c\, c 2 , , c n are constants. The conditions (10) form a system of n 
linear non-homogeneous equations with the aid of which we have to determine 
Ci , 02 , , c n We have to prove that this system is neither incompatible 
nor indeterminate. Thus we have to prove that the only solution of the 
homogeneous system obtained from (10) by putting 

(19) yi = y( = y", = = yT ~ l} = 

is the identically vanishing solution. 

Suppose, if possible, that there is a solution of the homogeneous system (10) 
(19) which does not vanish identically; denote it by 

H(x) ==7iMz) +72 M*) + + 7* **(*) + " + 7M*) 
where 



j k ^ , 7;t + i = jk+2 = = 7n = . 

Then h k (x) assumes at n ^ (k -- 1) + 1 points the same values as an 
integral of the equation 

W(hi, h 2 , , h k -i, y) = 
W(h lt h 2 , ---,^-1) 

of order k 1 . Then there exists, by virtue of I*, an intermediate point 

at which 

W(h l (^,h 2 (^), -, A-i($),M)) =0. 

This last equation is excluded if we assume the existence of the property W 
and thus Theorem II is proved. 

Example. Assume that the algebraic equation 

X n + Ci X n ~ l + + C n = 

has re different real roots GI , a 2 , , a n , where 
(20) i &lt; 2 &lt; 3 &lt; &lt; a 
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Then the linear homogeneous equation with constant coefficients 



has the integrals 

(21) e a ^, e a **, 

and possesses the property W in the interval ( - o , -f oo ) since the Wron- 
skians 

e a &gt; x , IV ( e-i* , e a * r ) , -, W (e a x , e a * x , , e a " x ) 
divided by 



respectively reduce to positive constants which may be calculated by Vander- 
monde s formula. Therefore we can interpolate by a suitably chosen linear 
combination of the n integrals (21) n arbitrary values at the n points .ri , ,r 2 , 
, x n where 
(22) r &lt; i o &lt; r &lt; &lt; r 

This fact is equivalent to the statement that 
(23) 



Thus we have proved (23) as a particular case of Theorem II. It may be 
shown directly in a slightly different manner that the determinant (23) is 
&gt; if (20) (22) are satisfied. I add that the property W belongs to any 
equation whatever with constant coefficients in the interval ( - x , + x ) 
if the roots of the corresponding algebraic equation are all real, whether disthu-f 
or not. We get a simple proof of this from (17). 

Any integral N ( x ) of the non-homogeneous equation (5) is of the form 

N(x) = N (x) +#(*). 

Here N (x) is a particular integral of the non-homogeneous equation (5) 
and H (x) is the general integral of the homogeneous equation (1); H (x) 
contains n arbitrary constants which we can adapt to any conditions of the 
form (10), by virtue of Theorem II. Thus we obtain the corollary 

THEOREM II*. There exists one and only one integral N (x} of the non- 
homogeneous equation (5) assuming n given values at n given points of the interval 
(a,6). 

5. I now prove Theorem III. The possibility of determining // ( ,r ) accord 
ing to the enunciated conditions follows from Theorem II and the possibility 
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of determining A T (.r) follows from II*. Let x be a point of (a, b) but not 
one of the n points mentioned at which /(.r) - H (x) and A 7 (.r) vanish. 
Then we have, by Theorem I**, N (x ) ^ 0. Therefore it is possible to 
find a constant. C satisfying the equation 

/ O 4 \ f 

Thus the function 

/(*) -//(.r) - CN(x) 

vanishes at n + 1 points of (a, b) . There exists, by Theorem I, an inter 
mediate point at which 



) "#() ~CN(U) = i/() -O-C-1 =0, 
that is, 



Substituting this value in (24) we have Theorem III.* 

I will now give some concrete cases of Theorem III with immediate appli 
cations. 

Example 1. Let Xi be the point at which we interpolate the differentiable 
function / ( x ) by an integral of the equation 

y + y = 0. 

Adopting the notation of Theorem III we have 

// ( A- ) = / ( a-i ) g i- , N(x)*-l - e x &gt;~* , 
and consequently for .r &gt; .i i 
(25) /(.r) = f(x 1 )tf^ + (l - e *i-*)(/() +/ ()) 

where .ri &lt; &lt; .r . From (25) we get at once the proposition f 
Suppose f ( x ) differentiable for x &gt; 0; &lt;/ze 

lim (/(*)+/ (*)) =c 

i &gt;+ 

implies lim / ( .r ) = c . 

We may assume c = (and apply the proof to/(.r ) c instead of to/(.f) ) . 
Choose first .ri sufficiently great so that \f(x) +/ (* )( &lt; e/2 for x &gt; A; 
then choose .r 2 , a - 2 &gt; .TI , such that / ( #1 ) e xi ~ x - &lt; 6/2 . Finally we get by 
(25) for .r &gt; ,r 2 



* The particular case corresponding to (2) was given by Ch. Hermite for points coinciding 
in an arbitrary manner. See (Euvres, vol. Ill, pp. 432-443, and T. J. Stieltjes, (Euvres, vol. I, 
pp. 47-60. For a more general point of view see G. D. Birkhoff, these Transactions, 
vol. 7 (1906), pp. 107-136. 

f G. H. Hardy, Quarterly Journal of Mathematics, vol. 35 (1903), pp. 
31-33. The proof given by O. Perron, Mathematische Zeitschrift, vol. 6 
(1920), p. 159, supposes more than the derivability of f(x) . 
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By the same argument we may prove an analogous result for any equation 
with constant coefficients if the roots of the corresponding algebraic equation 
are real and negative.* 

Example 2. In dealing with the equation 

(4) y" + y = 

we may choose for (a , b ) any interval of length TT . The two points at which 
we interpolate may be different or not. Interpolating at two distinct points 
x\, 0*2, where 

(26) 3-1 &lt; X-i &lt; Xi + 7T, 

we find the formula 

f , x _f(xi) sin (x z - x) + f(x 2 ) sin (x .TI) 

rf \ / " / \ 



(27) 



valid for 0*2 TT &lt; x &lt; #1 + TT; ^ is an interior point of the least interval 
containing x\ , x 2 , x . Interpolating at two points coinciding with x\ we find 



sin x z 




(28) X = Xl cos x ~ Xl + &gt;Tl sn x ~ -ri 

+ (1 --cos (*-*!))(/"({) +/(!)) 

valid for a-i TT &lt; x &lt; x\ + TT; lies between a i and x. We see that restric 
tion (26) is essential for the validity of (27), ,r 2 = .Vi + TT being generally in 
admissible; this point will be cleared up by Theorem IV. 

In order to give an application suppose /(.T) is an integral of the equation 

(29) y"+*(*)y = o, 

where 

(30) (*)&gt;!; 
suppose further that f(xi) &gt; . We have by (29) 



thus we may put (28) in the form 

(31) /(.T) =^sin(a;-o) -- (1 - cos (a- - .r 1 ) 

* 0. Perron, Mathematische Zeitschrift, vol. 6 (1920), pp. 161-163. 
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where 

(32) A sin (an - a) = /(a?i) &gt; 0, A cos (an - a) = f (an) . 

The situation shown by (30) (31) (32) is this: the curve A sin (x - a) 
meets the curve /(a) at the point x lt see (32); the curve A sin (x - a) runs 
on the upper side of the curve /(x) as far as/(x) runs on the upper side of the 
ar-axis, see (31) (30); but the curve A sin (a; - a) cuts the ar-axis at two points 
between which .TI lies and whose distance apart is TT; hence /(a-) reaches the 
ar-axis necessarily before A sin (a; - a ) cuts it and we have the theorem 

The distance between two consecutive zeros of any integral of the equation (29) 
satisfying the condition (30) is &lt; TT . 

This is a particular case of a well known classical theorem given by Sturm* 
that can be proved generally by the argument I have used in the foregoing 
concrete case. 

Any integral f(x) of the equation (29) has the following property: the 
inequality 

(33) /(.r) &gt; 
implies 

Consider the curve of which the equation is 

(35) r= /(V) 

in a system of polar coordinates r , d with origin . The inequality (34) 
expresses that the curve (35) appears from to be convex, as we may see, e.g., 
by (27). The result just proved may be formulated geometrically as follows: 

A curve appearing from a given point to be everywhere convex subtends an 
angle &lt; TT at . 

H. Poincare was led to the remark above mentioned ( 3) by constructing 
an analytical demonstration of this geometrically evident fact. 

6. The converse of Theorem II is also true: if the interpolation of any n 
values by an integral of (1) is possible at any n points of an interval, then the 
equation (1) possesses the property W in that interval. The possibility of 
interpolation, that is, the existence of a solution of the non-homogeneous 
system (10) with arbitrary y\, y(, , y ( l ~ l) , is equivalent to the non-existence 
of a non- vanishing solution of the corresponding homogeneous system; we 
used this equivalence in 4. Thus the most simple form of the converse to 
Theorem II is the following: 



* Cf. Maxime Bocher, Methodes de Sturm, Paris, 1917, p. 52. 
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THEOREM IV. //, except the identically vanishing intc&lt;jr&lt;il, there exists no 
integral of equation (1) vanishing at u jininfn &lt; if flic interval a s x &lt; b, then there 
exist n -- 1 integrals hi (x) , /; 2 (x) , , h n -i (x) such that the HVo/j.sY,-///.v 
(36) hi(x), W(h l (x),h 2 (x)), -, lV(hi(x),h,(x), , h n -i (.r) ) 
vanish noichere in the interval a &lt; x &lt; b . 

The signs of the Wronskians are of no importance provided that they are 
constant. If the Wronskians (30) do not vanish we can make them positive 
by substituting /( (.r) for /* (x) for certain values of v. 

Determine n integrals hi(x), A 2 (.r), , h n (x) satisfying the initial 
conditions 

fci (o) h{ (a) --= h [ (a) = = h ( r 3} (a) == // r 2 (a) == 0, 



(37) h z (a) = K (a) = h" (a) -- = //J 1 - 3 (a) = 0, //,?- 2) (a) 1, 
//,,_ 2 (a) == //U (a) == 0, /i_ 2 (o) 1, 

/;_! (a) = 0, A^_i () == 1, 
/ ,, (a) == 1. 

These integrals are linearly independent because their Wronskian = 1 
for .T = a; hi (x) vanishes at n -- 1 points coinciding with a; by hypothesis 
there exists no th point in the interval a Si x &lt; b at which /?i (.r) vanishes; 
therefore /h (.r) ^ for a &lt; .r &lt; b. Adopt the notation 

W(h 1} h z , -,h k ) ---- W k 
that we used in 2 and suppose that we have already proved 

(38) Wi^O, H o^O, -, W k *0 

f or a &lt; .r &lt; b , k &lt; n . 

Let c\, c-i , , Ok be constants and put 

ci /(i ( .r ) + c 2 M aO + + c* /u- (.r) + /J*+i (.r) = A ( x ) . 

/^ (.r) does not vanish identically, and therefore it vanishes at no more than 
n -- 1 points contained in the interval a ^= x &lt; b. Xow h (x) vanishes at 
n k -- 1 points coinciding with a by virtue of the conditions (37). De 
termine the constants c\ , c 2 , , Ck in such a manner that h ( x ) vanishes at 
k points coinciding with .r where a &lt; .T O &lt; b; this is possible because the 
determinant of the A" equations in question 



by the hypothesis of complete induction, that is, by (38). Beyond the 
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zeros already mentioned, h ( x ) has no other zeros for a &lt; x &lt; b . In particular 
h (.1- ) vanishes at no more than k points coinciding with x . Consequently, by 
virtue of hypothesis (38) and of the Theorem 3 of 1, 



(39) n 




, h k ,h k+ i) = W t+1 

does not vanish for x = x ; we obtain (39) from (15) (12) in the same manner 
as we obtained (18). The point .T O was arbitrarily chosen in the interval 
a &lt; x &lt; b and thus Theorem IV is proved. 

7. Another demonstration of the Theorems I, II, III essentially different 
from that given in 1-5 is based on the following theorem, 

THEOREM V. Suppose all the functions hi (x) , h 2 (.r) , , h n (x) , f(x) 
possess derivatives of order n throughout the interval a &lt; x &lt; b and satisfy the 
following n inequalities: 

hi (.r) &gt;0, W(hi(x),h z (x)} &gt;0, 

W(hi(x),h z (.r), -,*(*)) &gt;0. 

Suppose also that the I points x\ , x 2 , , x t and the corresponding positive integers 
mi , m 2 , , mi satisfy the conditions 

a &lt; xi &lt; x- 2 &lt; &lt; xi &lt; b, mi + m 2 + + m { = n + 1 . 

Consider the deter mi mint of order n + 1 

,h k C-i- ( xi ) , A t (z, ) , h k ( x, ) , 

, A* ""- 1 (a:,), , A* (.r,), , A* ""- ) (.r,)| , 

the n first rows being obtained by putting successively k = 1 , 2, 3, , n and 
the last row by substituting f ( x ) for h k ( x ) . 

There exists an intermediate point , xi &lt; &lt; x ; , such that the value assumed 
by the Wronskian at the point , that is, 



is &gt; , = or &lt; , according as the determinant (40) is &gt; , = or &lt; . 

Theorem Y may be compared to a well known determinantal theorem of 
H. A. Schwarz.* Theorem V may be proved by complete induction; its 
validity for the ( n - - 1 ) + 1 functions 

* H. A. Schwarz, Abhandlungen, vol. II, pp. 296-302; T. J. Stieltjes, (Euvres, - vol. II, pp. 
110-123. The determinant (40) ha.s been considered by H. M. Morse, G. A. Pfeiffer, and 
G. M. Green from a different point of view; see Bulletin of the American Mathe 
matical Society, vol. 23 (1916), pp. 114-122. 
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/M)V /M)V (h n (x)\ /7(*) V 

\ki(x)J VM*)/ UUV VM*)/ 

implies its validity for the n + 1 functions li\ ( x ) , A* ( .r ) , -,A n (),/(), 
as may be shown by (13) and by Rolle s theorem. I had originally based my 
demonstration of Theorems I, II, III on Theorem V. I was led to the treat 
ment of the subject I finally adopted by a kind remark made by Professor H. 
Weyl. 

EIDG. TECHNISCHE HOCHSCHULE, 
ZURICH, SWITZERLAND. 



ERRATA, VOLUME 24 

J. F. RITT, On algebraic functions which can be expressed in terms of radicals. 
Page 21, lines 30 and 33, for "n 2 " read "n". 



